FIG. 1A 

1 CACCCTATCC TACACTACTA GGAACTTGCA CAGTCCGCCT CGGGCAGCCC AAAGCTCCTC 60 

61 TGCCCACCCT GGCTCCCAAA ACCCTCCAAA ACAAAAGACC AGAAAAGCAC TCTCCACCCA 120 

121 GCAGCCAAAC GCCTCCTTCT TGACGCCAGC CCCCACCCTC TGTCTGCTCG AGCCCAGGAA 180 

181 AGGCCTGAAG GAACAGGCCG GGGAAGGAGC CCTCCCTCTC TCCCTTGTCC CTCCATCCAC 240 

241 CCAGCGCCGG CATCTGGAGA CCCTATGGCC CGGGCTCACT GGGGCTGCTG CCCCTGGCTG 300 
x MA RAHW GCC PWL 1^ 

301 GTCCTCCTCT GTGCTTGTGC CTGGGGCCAC ACAAAGCCAC TGGACCTTGG AGGGCAGGAT 360 
13VLLC ACA WGH TKPL DLG GQD 32 

361 GTGAGAAATT GTTCCACCAA CCCCCCTTAC CTTCCAGTTA CTGTGGTCAA TACCACAATG 420 
33VRNC STN PPY LPVT VVN 52 

421 TCACTCACAG CCCTCCGCCA GCAGATGCAG ACCCAGAATC TCTCAGCCTA CATCATCCCA 480 
53SLTA LRQ QMQ TQNL SAY IIP 72 

481 GGCACAGATG CTCACATGAA CGAGTACATC GGCCAACATG AC GAGAGGC G TGCGTGGATT 540 
73GTDA HMN EYI GQHD ERR AWI 92 

541 ACAGGCTTTA CAGGGTCTGC AGGAACTGCA GTGGTGACTA TGAAGAAAGC AGCTGTCTGG 600 
93TGFT GSA GTA VVTM KKA AVW 112 

601 ACCGACAGTC GCTACTGGAC TCAGGCTGAG CGGCAAATGG ACTGTAATTG GGAGCTCCAT 660 
113TDSR YWT QAE RQMD CNW 132 



661 AAGGAAGTTG GCACCACTCC TATTGTCACC TGGCTCCTCA CCGAGATTCC CGCTGGAGGG 720 
133KEVG TTP IVT WLLT EIP AGG 152 

721 CGTGTGGGTT TTGACCCCTT CCTCTTGTCC ATTGACACCT GGGAGAGTTA TGATCTGGCC 780 
153RVGF DPF LLS IDTW ESY 172 



781 CTCCAAGGCT CTAACAGACA GCTGGTGTCC ATCACAACCA ATCTTGTGGA CCTGGTATGG 840 
173 Li Q G S NRQ LVS ITTN LVD LVW 192 

841 GGATCAGAGA GGCCACCGGT TCCAAATCAA CCCATTTATG CCCTGCAGGA GGCATTCACA 900 
193GSER PPV PNQ PIYA LQE AFT 212 

901 GGGAGCACTT GGCAGGAGAA AGTATCTGGC GTCCGAAGCC AGATGCAGAA GCATCAAAAG 960 
213GSTW QEK VSG VRSQ MQK HQK 2Ji5 



FIG. 1B 



961 GTCCCGACTG CCGTCCTTCT GTCGGCGCTT GAGGAGACGG CCTGGCTCTT CAACCTTCGA 1020 
233 VPTA VLL SAL EETA W L F NLR 252 



• ••••• 

1021 GCCAGTGACA TCCCCTATAA CCCCTTCTTC TATTCCTACA CGCTGCTCAC AGACTCTTCT 1080 
253 ASDI PYN PFF YSYT L L T DSS 272 



1081 ATTAGGTTGT TTGCAAACAA GAGTCGCTTT AGCTCCGAAA CCTTGAGCTA TCTGAACTCC 1140 
273 I R L F A N K SRF SSET LSY LNS 292 



• ••••• 
1141 AGTTGCACAG GCCCCATGTG TGTGCAAATC GAGGATTACA GCCAAGTTCG TGACAGCATC 1200 

293 SCTG PMC VQI EDYS QVR DSI 312 

• ••••• 
1201 CAGGCCTACT CATTGGGAGA TGTGAGGATC TGGATTGGGA CCAGCTATAC CATGTATGGG 1260 

313 QAYS L G D VRI WIGT SYT MYG 332 

1261 ATCTATGAAA TGATACCAAG GGAGAAACTC GTGACAGACA CCTACTCCCC AGTGATGATG 1320 
333 IYEM IPR EKL VTDT YSP VMM 352 



1321 ACCAAGGCAG TGAAGAACAG CAAGGAGCAG GCCCTCCTCA AGGCCAGCCA CGTGCGGGAC 1380 
353 TKAV KNS KEQ A L L K ASH VRD 372 

• ••••• 

1381 GCTGTGGCTG TGATCCGGTA CTTGGTCTGG CTGGAGAAGA ACGTGCCCAA AGGCACAGTG 1440 
373 AVAV IRY LVW LEKN VPK GTV 392 



• • • 

1441 GATGAGTTTT CGGGGGCAGA GATCGTGGAC AAGTTCCGAG GAGAAGAACA GTTCTCCTCC 1500 
393 DEFS GAE IVD KFRG EEQ FSS 412 

1501 GGACCCAGTT TTGAAACCAT CTCTGCTAGT GGTTTGAATG CTGCCCTGGC CCACTACAGC 1560 
413 GPSF ETI SAS GL.NA ALA HYS 432 



1561 CCGACCAAGG AGCTGAACCG CAAGCTGTCC TCAGATGAGA TGTACCTGCT GGACTCTGGG 1620 

433 PTKE LNR KLS SDEM YLL DSG 452 

• ••••• 

1621 GGGCAGTACT GGGACGGGAC CACAGACATC ACCAGAACAG TCCACTGGGG CACCCCCTCT 1680 

453 GQYW DGT TDI TR TV HWG TPS 472 

»•••*• 

1681 GCCTTTCAGA AGGAGGCATA TACCCGTGTG CTGATAGGAA ATATTGACCT GTCCAGGCTC 1740 

473 AFQK EAY TRV LIGN IDL SRL 492 

• 

1741 ATCTTTCCCG CTGCTACATC AGGGCGAATG GTGGAGGCCT TTGCCCGCAG AGCCTTGTGG 1800 

493 IFPA ATS GRM VEAF ARR ALW 512 



1801 GATGCTGGTC TCAATTATGG TCATGGGACA GGCCACGGCA TTGGCAACTT CCTGTGTGTG 1860 
513DAGL NYG HGT GHGI GNF LCV 532 



FIG. 1C 

1861 CATGAGTGGC CAGTGGGATT CCAGTCCAAC AACATCGCTA TGGCCAAGGG CATGTTCACT 1920 
533 HEWP VGF QSN NIAM AKG MFT 552 

1921 TCCATTGAAC CTGGTTACTA TAAGGATGGA GAATTTGGGA TCCGTCTCGA AGATGTGGCT 1980 
553 SIEP GYY KDG EFGI RLE DVA 572 

1981 CTCGTGGTAG AAGCAAAGAC CAAGTACCCA GGGGAGCTAC CTGACCTTGT GGTATCATTT 2040 
573 Li V V E AKT KYP GELP D Ii V VSF 592 

2041 GTGCCCTATG ACCGGAACCT CATCGATGTC AGCCTGCTGT CTCCCGAGCA TCTCCAGTAC 2100 
593 VPYD RNL IDV SLLS PEH LQY 612 

2101 CTGAATCGCT ACTACCAGAC CATCCGGGAG AAGGTGGGTC CAGAGCTGCA GAGGCGCCAG 2160 
613LNRY YQT IRE KVGP ELQ RRQ 632 

• • * 

2161 CTACTAGAGG AGTTCGAGTG GCTTCAACAG CACACAGAGC CCCTGGCCGC CAGGGCCCCA 2220 
633 LLEE FEW L Q Q HTEP L A A RAP 652 

2221 GACACCGCCT CCTGGGCCTC TGTGTTAGTG GTCTCCACCC TTGCCATCCT TGGCTGGAGT 2280 
653 DTAS WAS VLV VSTL AIL GWS 672 

2281 GTCTAGAGGC TCCAGACTCT CCTGTTAACC CTCCATCTAG ATGGGGGGCT CCCTTGCTTA 2340 
673 V * 673 

2341 GCTCCCCTCA CCCTGCACTG AACATACCCC AAGAGCCCCT GCTGGCCCAT TGCCTAGAAA 2400 
2401 CCTTTGCATT CATCCTCCTT CTCCAAGACC TATGGAGAAG GTCCCAGGCC CCAGGAAACA 2460 
2461 CAGGGCTTCT TGGCCCCAGA TGGCACCTCC CTGCACCCCG GGGTTGTATA CCACACCCTG 2520 
2521 GGCCCCTAAT CCCAGGCCCC GAAATAGGAA AGCCAGCTAG TCTCTTCTCT TCTGTGATCT 2580 
2581 CAGTAGGCCT AACCTATAAC CTAACACAGA CTGCTACAGC TGCTCCCCTC CCGCCAAACA 2640 
2641 AAGCCCCAAG AAAACAATGC CCCTACCACC CAAGGGTGCC ATGGTCCCGG GAAAACCCAA 2700 

2701 CCTGTCACCG CGTGTTGGGC GTAACCAGAA CTGTTCCCCC CCACCAGGGC TTAAAAATCG 2760 

• • m 

2761 CCCCCACTTT TTAACCATCG TCCATTAACC ACCTGGTGGG CATAGCCAGA GCTGTTCGAA 2820 

2821 CCCAGCCAGG GATGAAAAAT CAACCCCCGA CATGGAACCC ATGATTCCTA AACCCGGGGT 2880 

2881 AGGTTCCATG CCAAGTAACA GCAGAGGGAG TTAAGCCATA GGAATTTGGC TGTGGAGTAA 2940 



FIG. 1D 

2941 GAGGGAATGC GGTGAGGCAG TGTGGAATAT GACCCTACCA GAGGTTGGAG AACAAACTTG 3000 
3001 GGCAGCCGGA ACCCGTCACT ATTTTAGATT CCTGGCATTC GAGGAGCCCT TTGAACTTTC 3060 
3061 CAAAGTGCAG CCACAGCTAC AATGCTGTTA AATCCTCCCA CATTTCTTGG ATGCCCCTTC 3120 

3121 ACCTTGTGTG GACAGTGTCT GGTTTCCCCA TTTTACAGAC AGGAAAACTG AGCTTCAGAC 3180 

. • • • • • 

3181 AGGGGGTGGG CTTTGCCTAA GGACACACAA ATTTGGTTGG GAGTTGATGG GGCCAGATGA 3240 

• •••*• 
3241 GCCAGCATTC CAGCTGTTTC ACCCTTCAGC AACATGCAGA GTCCCTGAGC CCACCTCCCA 3300 

3301 GCCCTCTCCT CATTCTCTGA ACCCACTGTG GTGAGAAGAA TTTGCTCCGG CCAAATTGGC 3360 

3361 CGTTAGC C AC CTGGGTCCAC ATCCTGCTAA GACGTTTAAA ACAGCCTAAC AAAGACACTT 3420 

3421 GCCTGTGG 3428 



FIG. 2A 



• * • 

1 CACCCTATCC TACACTACTA GGAACTTGCA 



61 TGCCCACCCT GGCTCCCAAA ACCCTCCAAA 



. • • 

121 GCAGCCAAAC GCCTCCTTCT TGACGCCAGC 

181 AGGCCTGAAG GAACAGGCCG GGGAAGGAGC 

241 CCAGCGCCGG CATCTGGAGA CCCTATGGCC 
1 MA 



• • * 

301 GTCCTCCTCT GTGCTTGTGC CTGGGGCCAC 
13VLLC ACA WGH 

361 GTGAGAAATT GTTCCACCAA CCCCCCTTAC 
33VRNC STN PPY 



421 TCACTCACAG CCCTCCGCCA GCAGATGCAG 
53SLTA LRQ QMQ 

• 

481 GGCACAGATG CTCACATGAA CGAGTACATC 
73GTDA HMN EYI 

541 ACAGGCTTTA CAGGGTCTGC AGGAACTGCA 
93TGFT GSA GTA 

• • • 
601 ACCGACAGTC GCTACTGGAC TCAGGCTGAG 
113 TDSR YWT QAE 

• • • 
661 AAGGAAGTTG GCACCACTCC TATTGTCACC 
133 KEVG TTP IVT 

721 CGTGTGGGTT TTGACCCCTT CCTCTTGTCC 
153 RVGF DPF LLS 

♦ 

781 CTCCAAGGCT CTAACAGACA GCTGGTGTCC 
173 LQGS NRQ LVS 

841 GGATCAGAGA GGCCACCGGT TCCAAATCAA 
193 GSER PPV PNQ 

• • • 

901 GGGAGCACTT GGCAGGAGAA AGTATCTGGC 
213 GSTW QEK VSG 



• • • 

CAGTCCGCCT CGGGCAGCCC AAAGCTCCTC 60 



• • ♦ 

ACAAAAGACC AGAAAAGCAC TCTCCACCCA 120 



• • • 

CCCCACCCTC TGTCTGCTCG AGCCCAGGAA 180 



• • • 

CCTCCCTCTC TCCCTTGTCC CTCCATCCAC 240 



CGGGCTCACT GGGGCTGCTG CCCCTGGCTG 300 
RAHW GCC PWL 12 



ACAAAGCCAC TGGACCTTGG AGGGCAGGAT 360 
TKPL DLG GQD 32 



• • • 

CTTCCAGTTA CTGTGGTCAA TACCACAATG 420 
LPVT VVN TTM 52 



• • • 
ACCCAGAATC TCTCAGCCTA CATCATCCCA 480 
TQNL SAY IIP 72 

• • • 
GGCCAACATG AC G AGAGGC G TGCGTGGATT 540 
GQHD ERR AWI 92 

GTGGTGACTA TGAAGAAAGC AGCTGTCTGG 600 
VVTM KKA AVW 112 



CGGCAAATGG ACTGTAATTG GGAGCTCCAT 660 
RQMD CNW ELH 132 

TGGCTCCTCA CCGAGATTCC CGCTGGAGGG 720 
WLLT EIP AGG 152 

• • • 

ATTGACACCT GGGAGAGTTA TGATCTGGCC 780 
IDTW ESY DLA 172 



ATCACAACCA ATCTTGTGGA CCTGGTATGG 840 
ITTN LVD LVW 192 



• • • 

CCCATTTATG CCCTGCAGGA GGCATTCACA 900 
PIYA LQE AFT 212 



GTCCGAAGCC AGATGCAGAA GCATCAAAAG 960 
VRSQ MQK HQK 232 



FIG. 2B 



• • • 

961 GTCCCGACTG CCGTCCTTCT GTCGGCGCTT 
233 VPTA VLL SAL 



1021 GCCAGTGACA TCCCCTATAA CCCCTTCTTC 
253 ASDI PYN PFF 



1081 ATTAGGTTGT TTGCAAACAA GAGTCGCTTT 
273 IRLF ANK SRF 

1141 AGTTGCACAG GCCCCATGTG TGTGCAAATC 
293 SCTG PMC VQI 



• • • 

1201 CAGGCCTACT CATTGGGAGA TGTGAGGATC 
313 QAYS LGD VRI 



1261 ATCTATGAAA TGATACCAAG GGAGAAACTC 
333 IYEM IPR EKL 



• • • 

1321 AC C AAGGC AG TGAAGAACAG CAAGGAGCAG 
353 TKAV KNS KEQ 



1381 GCTGTGGCTG TGATCCGGTA CTTGGTCTGG 
373 AVAV IRY LVW 



1441 GATGAGTTTT CGGGGGCAGA GATCGTGGAC 
393 DEFS GAE IVD 

• * • 

1501 GGACCCAGTT TTGAAACCAT CTCTGCTAGT 
413 GPSF ETI SAS 



1561 CCGACCAAGG AGCTGAACCG CAAGCTGTCC 
433 PTKE LNR KLS 

1621 GGGCAGTACT GGGACGGGAC CACAGACATC 
453 GQYW DGT TDI 

« • • 

1681 GCCTTTCAGA AGGAGGCATA TACCCGTGTG 
473 AFQK EAY TRV 

1741 ATCTTTCCCG CTGCTACATC AGGGCGAATG 
493 IFPA ATS GRM 



• • • 

GAGGAGACGG CCTGGCTCTT CAACCTTCGA 1020 
EETA WLF NLR 252 



TATTCCTACA CGCTGCTCAC AGACTCTTCT 1080 
YSYT LLT DSS 272 



AGCTCCGAAA < CCTTGAGCTA TCTGAACTCC 1140 
SSET L S Y L N S 292 



GAGGATTACA GCCAAGTTCG TGACAGCATC 1200 
EDYS QVR DSI 312 



TGGATTGGGA CCAGCTATAC CATGTATGGG 1260 
WIGT SYT MYG 332 



• • • 

GTGACAGACA CCTACTCCCC AGTGATGATG 1320 
VTDT YSP VMM 352 



• ■ • 

GCCCTCCTCA AGGCCAGCCA CGTGCGGGAC 1380 
A L L K ASH VRD 372 



• • • 
CTGGAGAAGA ACGTGCCCAA AGGCACAGTG 1440 
LEKN VPK GTV 392 

• • • 
AAGTTCCGAG GAGAAGAACA GTTCTCCTCC 1500 
KFRG EEQ FSS 412 



GGTTTGAATG CTGCCCTGGC CCACTACAGC 1560 
GLNA ALA HYS 432 



TCAGATGAGA TGTACCTGCT GGACTCTGGG 1620 
SDEM YLL DSG 452 

• • • 
ACCAGAACAG TCCACTGGGG CACCCCCTCT 1680 
TRTV HWG TPS 472 

CTGATAGGAA ATATTGACCT GTCCAGGCTC 1740 
LIGN IDL SRL 492 

• • • 
GTGGAGGCCT TTGCCCGCAG AGCCTTGTGG 1800 
VEAF ARR ALW 512 



1801 GATGCTGGTC TCAATTATGG TCATGGGACA GGCCACGGCA TTGGCAACTT CCTGTGTGTG 1860 
513 DAGL NYG HGT GHGI GNF LCV 532 



FIG. 2C 

1861 CATGAGTGGC CAGTGGGATT CCAGTCCAAC AACATCGCTA TGGCCAAGGG CATGTTCACT 1920 
533 HEWP VGF QSN NIAM AKG MFT 552 

1921 TCCATTGAAC CTGGTTACTA TAAGGATGGA GAATTTGGGA TCCGTCTCGA AGATGTGGCT 1980 
553 S I E P GYY KDG EFGI RLE DVA 572 

1981 CTCGTGGTAG AAGCAAAGAC CAAGTACCCA GGGGAGCTAC CTGACCTTGT GGTATCATTT 2040 
573 L V V E AKT KYP GELP D L V VSF 592 

m 

2041 GTGCCCTATG ACCGGAACCT CATCGATGTC AGCCTGCTGT CTCCGGAGCA TCTCCAGTAC 2100 
593 VPYD RNL IDV S L L S PEH LQY 612 

2101 CTGAATCGCT ACTACCAGAC CATCCGGGAG AAGGTGGGTC CAGAGCTGCA GAGGCGCCAG 2160 
613LNRY YQT IRE KVGP ELQ RRQ 632 

2161 CTACTAGAGG AGTTCGAGTG GCTTCAACAG CACACAGAGC CCCTGGCCGC CAGGGCCCCA 2220 
633 L L E E FEW LQQ HTEP LAA RAP 652 

. • • 

2221 GACACCGCCT CCTGGGCCTC TGTGTTAGTG GTCTCCACCC TTGCCATCCT TGGCTGGAGT 2280 
653 DTAS WAS VLV VSTL AIL GWS 672 

2281 GTCTAGAGGC TCCAGACTCT CCTGTTAACC CTCCATCTAG ATGGGGGGCT CCCTTGCTTA 2340 
673 V * 673 

2341 GCTCCCCTCA CCCTGCACTG AACATACCCC AAGAGCCCCT GCTGGCCCAT TGCCTAGAAA 2400 
2401 CCTTTGCATT CATCCTCCTT CTCCAAGACC TATGGAGAAG GTCCCAGGCC CCAGGAAACA 2460 
2461 CAGGGCTTCT TGGCCCCAGA TGGCACCTCC CTGCACCCCG GGGTTGTATA CCACACCCTG 2520 
2521 GGCCCCTAAT CCCAGGCCCC GAAATAGGAA AGCCAGCTAG TCTCTTCTCT TCTGTGATCT 2580 
2581 CAGTAGGCCT AACCTATAAC CTAACACAGA CTGCTACAGC TGCTCCCCTC CCGCCAAACA 2640 
2641 AAGCCCCAAG AAAACAATGC CCCTACCACC CAAGGGTGCC ATGGTCCCGG GAAAACCCAA 2700 
2701 CCTGTCACCG CGTGTTGGGC GTAACCAGAA CTGTTCCCCC CCACCAGGGC TTAAAAATCG 2760 
2761 CCCCCACTTT TTAACCATCG TCCATTAACC ACCTGGTGGG CATAGCCAGA GCTGTTCGAA 2820 
2821 CCCAGCCAGG GATGAAAAAT CAACCCCCGA CATGGAACCC ATGATTCCTA AACCCGGGGT 2880 
2881 AGGTTCCATG CCAAGTAACA GCAGAGGGAG TTAAGCCATA GGAATTTGGC TGTGGAGTAA 2940 



FIG. 2D 

2941 GAGGGAATGC GGTGAGGCAG TGTGGAATAT GACCCTACCA GAGGTTGGAG AACAAACTTG 3000 

• ••••• 
3001 GGCAGCCGGA ACCCGTCACT ATTTTAGATT CCTGGCATTC GAGGAGCCCT TTGAACTTTC 3060 

3061 CAAAGTGCAG CCACAGCTAC AATGCTGTTA AATCCTCCCA CATTTCTTGG ATGCCCCTTC 3120 

3121 ACCTTGTGTG GACAGTGTCT GGTTTCCCCA TTTTACAGAC AGGAAAACTG AGCTT CAGAC 3180 

• ••••• 
3181 AGGGGGTGGG CTTTGCCTAA GGACACACAA ATTTGGTTGG GAGTTGATGG GGCCAGATGA 3240 

• ••••• 
3241 GCCAGCATTC CAGCTGTTTC ACCCTTCAGC AACATGCAGA GTCCCTGAGC CCACCTCCCA 3300 

3301 GCCCTCTCCT CATTCTCTGA ACCCACTGTG GTGAGAAGAA TTTGCTCCGG CCAAATTGGC 3360 

• ••••• 
3361 CGTTAGCCAC CTGGGTCCAC ATCCTGCTAA GACGTTTAAA ACAGCCTAAC AAAGACACTT 3420 

3421 GCCTGTGG 3428 



FIG. 3A 



1 CTGTGCATGG CATCATCCTG GCCCCCTCTA 
1 MASSWPPL 

61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT 
19FPQN ATA CDN 

121 GTGCTGCCGA CATTTATCAT CTCCATCTGT 
39 V L P T FII SIC 



181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA 
59 L L V F LLP RRQ 



241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG 
79LAAS D LV FVL 

• 

301 AACCAGTTTA ACTGGCCTTT CGGAGCCCTC 
99NQFN WPF GAL 

• 

361 GCCAATTTGT TCATCAGCAT CTTCCTGGTG 
119 ANLF ISI FLV 

421 CTGGTGCACC CTATGGCCAG CGGAAGGCAG 
139 LVHP MAS GRQ 

. . • 

481 GTGCTCATCT GGGTTGTGGG GGGCCTCTTG 
159 VLIW VVG GLL 

541 CAAGCCGTCC CAGATCTGAA CATCACCGCC 
179 QAVP DLN ITA 

601 CACTTTGCAA GGATTGTGGA GTTAAATATT 
199 HFAR IVE L N I 



, • • 

661 GTCTTCTTCA ACTACCACAT CCTGGCCTCC 
219 VFFN YHI LAS 

. • • 

721 AGAGTGCGGG GGCCGAAGGA TAGCAAGACC 
239 RVRG PKD SKT 

781 TTCCTGGTCT GCTGGGCCCC TTACCACTTC 
259 FLVC WAP YHF 



. • • 

GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 60 
ELQS SNQ SQL 18 

• • • 

GCTCCAGAAG CCTGGGACCT GCTGCACAGA 120 
APEA WDL LHR 38 

TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 180 
FFGL LGN LFV 58 

• 

CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 
LNVA EIY LAN 78 



GGCTTGCCCT TCTGGGCAGA GAATATCTGG 300 
GLPF WAE NIW 98 

■ • * 

CTCTGCCGTG TCATCAACGG GGTCATCAAG 360 
LCRV I N G VIK 118 

GTGGCCATCA GCCAGGACCG CTACCGCGTG 420 
VAIS QDR YRV 138 



• • • 
CAGCGGCGGA GGCAGGCCCG GGTCACCTGC 480 
QRRR QAR VTC 158 

• • • 
AGCATCCCCA CATTCCTGCT GCGATCCATC 540 
SIPT FLL RSI 178 

• 

TGCATCCTGC TCCTCCCCCA TGAGGCCTGG 600 
CILL LPH EAW 198 

CTGGGTTTCC TCCTACCACT GGCTGCGATC 660 
LGFL LPL A A I 218 



• • • 

CTGCGAACGC GGGAGGAGGT CAGCAGGACA 720 
LRTR EEV SRT 238 



ACAGCGCTGA TCCTCACGCT CGTGGTTGCC 780 
TALI LTL VVA 258 



TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 
FAFL EFL FQV 278 



841 CAAGCAGTCC GAGGCTGCTT TTGGGAGGAC TTCATTGACC TGGGCCTGCA ATTGGCCAAC 900 
279 QAVR GCF WED FIDL GLQ LAN 298 



FIG. 3B 



901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT 
299 FFAF TNS SLN 

961 TTCAGGACCA AGGTCTGGGA ACTTTATAAA 
319 FRTK VWE LYK 



1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA 
339 SSSH RKE IFQ 



CCAGTAATTT ATGTCTTTGT GGGCCGGCTC 960 
PVIY VFV GRL 318 

CAATGCACCC CTAAAAGTCT TGCTCCAATA 1020 
QCTP KSL API 338 

CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 
L F W R N * 353 



1081 CC 1082 



FIG. 4A 



1 CTGTGCATGG CATCATCCTG GCCCCCTCTA GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 60 
1 MA SSW PPL E L Q S S N Q SQL 18 

61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT GCTCCAGAAG CCTGGGACCT GCTGCACAGA 120 
19FPQN ATA CDN APEA WDL LHR 38 

121 GTGCTGCCGA CATTTATCAT CTCCATCTGT TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 180 
39VLPT FII SI C FFGL LGN LFV 58 

181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 
59LLVF LLP RRQ LNVA EIY LAN 78 

241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG GGCTTGCCCT TCTGGGCAGA GAATATCTGG 300 
79LAAS DLV FVL GLPF WAE NIW 98 

301 AACCAGTTTA ACTGGCCTTT CGGAGCCCTC CTCTGCCGTG TCATCAACGG GGTCATCAAG 360 
99NQFN WPF GAL LCRV ING VIK 118 

361 GCCAATTTGT TCATCAGCAT CTTCCTGGTG GTGGCCATCA GCCAGGACCG CTACCGCGTG 420 
119ANLF ISI FLV VAIS QDR 138 

421 CTGGTGCACC CTATGGCCAG CGGAAGGCAG CAGCGGCGGA GGCAGGCCCG GGTCACCTGC 480 
139LVHP MAS GRQ Q R R R QAR VTC 158 

481 GTGCTCATCT GGGTTGTGGG GGGCCTCTTG AGCATCCCCA CATTCCTGCT GCGATCCATC 540 
159VLIW VVG GLL SIPT FLL RSI 178 

541 CAAGCCGTCC CAGATCTGAA CATCACCGCC TGCATCCTGC TCCTCCCCCA TGAGGCCTGG 600 
179 Q A V P DLN ITA CILL LPH 198 

601 CACTTTGCAA GGATTGTGGA GTTAAATATT CTGGGTTTCC TCCTACCACT GGCTGCGATC 660 
199HFAR IVE LNI LGFL LPL 218 

661 GTCTTCTTCA ACTACCACAT CCTGGCCTCC CTGCGAACGC GGGAGGAGGT CAGCAGGACA 720 
219VFFN YHI LAS LRTR EEV 238 

721 AGAGTGCGGG GGCCGAAGGA TAGCAAGACC ACAGCGCTGA TCCTCACGCT CGTGGTTGCC 780 
239 RVRG PKD SKT TALI LTL VVA 258 

781 TTCCTGGTCT GCTGGGCCCC TTACCACTTC TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 
259 FLVC WAP YHF FAFL EFL F QV 278 

841 CAAGCAGTCC GAGGCTGCTT TTGGGAGGAC TTCATTGACC TGGGCCTGCA ATTGGCCAAC 900 
279 Q A V R GCF WED FIDL GLQ LAN 298 



FIG. 4B 



901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT 
299 FFAF TNS SLN 

961 TTCAGGACCA AGGTCTGGGA ACTTTATAAA 
319 FRTK VWE LYK 

1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA 
339 SSSH RKE IFQ 



CCAGTAATTT ATGTCTTTGT GGGCCAGCTC 960 
PVIY VFV G 2 Ij 318 

CAATGCACCC CTAAAAGTCT TGCTC CAATA 1020 
QCTP KSL API 338 

CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 
L F W R N * 353 



1081 CC 1082 



FIG. 5A 



1 CTGTGCATGG CATCATCCTG GCCCCCTCTA GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 60 
1 MA SSW PPL E L Q S S N Q SQL 18 

61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT GCTCCAGAAG CCTGGGACCT GCTGCACAGA 120 
19FPQN ATA CDN APEA WDL LHR 38 

121 GTGCTGCCAA CATTTATCAT CTCCATCTGT TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 180 
39VLPT FII SIC FFGL LGN LFV 58 

181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 
59LLVF LLP RRQ LNVA EIY LAN 78 

241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG GGCTTGCCCT TCTGGGCAGA GAATATCTGG 300 
79LAAS DLV FVL GLPF WAE NIW 98 

301 AACCAGTTTA ACTGGCCTTT CGGAGCCCTC CTCTGCCGTG TCATCAACGG GGTCATCAAG 360 
99NQFN WPF GAL LCRV ING VIK 118 

361 GCCAATTTGT TCATCAGCAT CTTCCTGGTG GTGGCCATCA GCCAGGACCG CTACCGCGTG 420 
119ANLF ISI FLV VAIS QDR YRV 138 

421 CTGGTGCACC CTATGGCCAG CGGAAGGCAG CAGCGGCGGA GGCAGGCCCG GGTCACCTGC 480 
139LVHP MAS GRQ QRRR QAR VTC 158 

481 GTGCTCATCT GGGTTGTGGG GGGCCTCTTG AGCATCCCCA CATTCCTGCT GCGATCCATC 540 
159VLIW VVG GLL SIPT FLL RSI 178 

541 CAAGCCGTCC CAGATCTGAA CATCACCGCC TGCATCCTGC TCCTCCCCCA TGAGGCCTGG 600 
179 Q A V P D L N I T A CILL LPH EAW 198 

601 CACTTTGCAA GGATTGTGGA GTTAAATATT CTGGGTTTCC TCCTACCACT GGCTGCGATC 660 
199HFAR IVE LNI LGFL LPL AAI 218 

661 GTCTTCTTCA ACTACCACAT CCTGGCCTCC CTGCGAACGC GGGAGGAGGT CAGCAGGACA 720 
219VFFN Y H I LAS LRTR E E V SRT 238 

721 AGAGTGCGGG GGCCGAAGGA TAGCAAGACC ACAGCGCTGA TCCTCACGCT CGTGGTTGCC 780 
239 RVRG PKD SKT TALI LTL VVA 258 

781 TTCCTGGTCT GCTGGGCCCC TTACCACTTC TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 
259 FLVC WAP YHF FAFL EFL FQV 278 

• * ' 

841 CAAGCAGTCC GAGGCTGCTT TTGGGAGGAC TTCATTGACC TGGGCCTGCA ATTGGCCAAC 900 
279 Q A V R GCF WED FIDL GLQ LAN 298 



FIG. 5B 



901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT CCAGTAATTT ATGTCTTTGT GGGCCGGCTC 960 
299 FFAF TNS SLN PVIY VFV GRL 318 

961 TTCAGGACCA AGGTCTGGGA ACTTTATAAA CAATGCACCC CTAAAAGTCT TGCTCCAATA 1020 
319FRTK VWE LYK QCTP KSL API 338 

, • • * 

1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 
339 SSSH RKE IFQ LFWR N* 353 



1081 CC 1082 



FIG. 6A 



1 ATGTTCTCTC CCTGGAAGAT ATCAATGTTT CTGTCTGTTC GTGAGGACTC CGTGCCCACC 60 
1MFSP WKI SMF LSVR EDS VPT 20 

61 ACGGCCTCTT TCAGCGCCGA CATGCTCAAT GTCACCTTGC AAGGGCCCAC TCTTAACGGG 120 
21TASF SAD MLN VTLQ GPT LNG 40 

121 ACCTTTGCCC AGAGCAAATG CCCCCAAGTG GAGTGGCTGG GCTGGCTCAA CACCATCCAG 180 
41TFAQ SKC PQV EWLG WLN TIQ 60 

181 CCCCCCTTCC TCTGGGTGCT GTTCGTGCTG GCCACCCTAG AGAACATCTT TGTCCTCAGC 240 
61PPFL WVL FVL ATLE NIF VLS 80 

241 GTCTTCTGCC TGCACAAGAG CAGCTGCACG GTGGCAGAGA TCTACCTGGG GAACCTGGCC 300 
81VFCL HKS SCT VAEI YLG NLA 100 



301 GCAGCAGACC TGATCCTGGC CTGCGGGCTG CCCTTCTGGG CCATCACCAT CTCCAACAAC 360 
101 A A D L I L A CGL PFWA ITI SNN 120 



3 61 TTCGACTGGC TCTTTGGGGA GACGCTCTGC CGCGTGGTGA ATGCCATTAT CTCCATGAAC 420 
121FDWL FGE TLC RVVN All SMN 140 



421 CTGTACAGCA GCATCTGTTT CCTGATGCTG GTGAGCATCG ACCGCTACCT GGCCCTGGTG 480 
141LYSS ICF L M L VSID RYL ALV 160 



481 AAAAC CATGT CCATGGGCCG GATGCGCGGC GTGCGCTGGG CCAAGCTCTA CAGCTTGGTG 540 
161KTMS MGR MRG VRWA KLY SLV 180 



m 

541 ATCTGGGGGT GTACGCTGCT CCTGAGCTCA CCCATGCTGG TGTTCCGGAC CATGAAGGAG 600 
181IWGC TLL LSS PMLV FRT MKE 200 



601 TACAGCGATG AGGGCCACAA CGTCACCGCT TGTGTCATCA GCTACCCATC CCTCATCTGG 660 
201YSDE GHN VTA CVIS YPS LIW 220 



661 GAAGTGTTCA CCAACATGCT CCTGAATGTC GTGGGCTTCC TGCTGCCCCT GAGTGTCATC 720 
221EVFT N M I» LNV VGFL LPL SVI 240 



721 ACCTTCTGCA CGATGCAGAT CATGCAGGTG CTGCGGAACA ACGAGATGCA GAAGTTCAAG 780 
241TFCT MQI MQV LRNN EMQ KFK 260 



781 GAGATCCAGA CGGAGAGGAG GGCCACGGTG CTAGTCCTGG TTGTGCTGCT GCTATTCATC 840 
261EIQT ERR A T V LVLV VLL LFI 280 

841 ATCTGCTGGC TGCCCTTCCA GATCAGCACC TTCCTGGATA CGCTGCATCG CCTCGGCATC 900 
281ICWL PFQ 1ST FLDT LHR LGI 300 



FIG. 6B 

901 CTCTCCAGCT GCCAGGACGA GCGCATCATC GATGTAATCA CACAGATCGC CTCCTTCATG 960 
301LSSC QDE RII DVIT " ' 

961 GCCTACAGCA ACAGCTGCCT CAACCCACTG GTGTACGTGA TCGTGGGCAA GCGCTTCCGA 1020 
321AYSN SCL. NPL V Y V I 

*s rrr; -rr? c rrr rrr G tft ~r sr 

,031 a™gG AGAACXCCA, GGGCACAC.G CGGACCJCCA TCTCCGTGGA ACGCCAGATT 1140 

1141 CACAAACTGC AGGACTGGGC AGGGAGCAGA CAGTGAGCAA ACGCCAGCAG GGCTGCTGTG 1200 
381HKLQ DWA GSR Q 

1201 AATTTGTGTA AGGATTGAGG GACAGTTGCT TTTCAGCATG GGCCCAGGAA TGCCAAGGAG 1260 
1261 ACATCTATGC ACGACCTTGG GAAATGAGTT GATGTCTCCG GTAAAACACC GGAGACTAAT 1320 
1321 TCCTGCCCTG CCCAATTTTG CAGGGAGCAT GGCTGTGAGG ATGGGGTGAA CTCACGCACA 1380 
1381 GCCAAGGACT CCAAAATCAC AACAGCATTA CTGTTCTTAT TTGCTGC C AC ACCTGAGCCA 1440 
1441 GCCTGCTCCT TCCCAGGAGT GGAGGAGGCC TGGGGGGAGG GAGAGGAGTG ACTGAGCTTC 1500 
1501 CCTCCCGTGT GTTCTCCGTC CCTGCCCCAG CAAGACAACT TAGATCTCCA GGAGAACTGC 1560 
1561 CATCCAGCTT TGGTGCAATG GCTGAGTGCA CAAGTGAGTT GTTGCCCTGG GTTTCTTTAA 1620 
1621 TCTATTCAGC TAGAACTTTG AAGGACAATT TCTTGCATTA ATAAAGGTTA AGCCCTGAGG 1680 
1681 GGTCCCTGAT AACAACCTGG AGACCAGGAT TTTATGGCTC CCCTCACTGA TGGACAAGGA 1740 
1741 GGTCTGTGCC AAAGAAGAAT CCAATAAGCA CATATTGAGC ACTTGCTGTA TATGCAGTAT 1800 
1801 TGAGCACTGT AGGCAAGACC CAAGAAAGAG AAGGAGC CAT CTCCATCTTG AAGGAACTCA 1860 
1861 AAGACTCAAG TGGGAACGAC TGGGCACTGC CACCACCAGA AAGCTGTTCG ACGAGACGGT 1920 
1921 CGAGCAGGGT GCTGTGGGTG ATATGGACAG CAGAAGGGGG AGACCAAGGT TCCAGCTCAA 1980 
1981 CCAATAACTA TTGCACAACC ACCTGTCCCT GCCTCAGTTC CCTTTTATGT AACATGAAGT 2040 



FIG. 6C 

2041 CGTTGTGAGG GTTAAAGGCA GTAACAGGTA TAAAGTACTT AGAAAAGCAA AGGGTGCTAC 2100 
2101 GTACATGTGA GGCATCATTA CGCAGACGTA ACTGGGATAT GTTTACTATA AGGAAAAGAC 2160 
2161 ACTGAGGTCT AGAAATAGCT CCGTGGAGCA GAATCAGTAT TGGGAGCCGG TGGCGGTGTG 2220 
2221 AAGCACCAGT GTCTGGCACA CAGTAGGTGC TCATTGGCTC CCTTCCACCT GTCATTCCCA 2280 
2281 CCACCCTGAG GCCCCAACCG CCACACACAC AGGAGCATTT GGAGAGAAGG CCATGTCTTC 2340 
2341 AAAGTCTGAT TTGTGATGAG GCAGAGGAAG ATATTTCTAA TCGGTCTTGC CCAGAGGATC 2400 
2401 ACAGTGCTGA GACCCCCCAC CACCAGCCGG TACCTGGGAA GGGGGAGAGT GCAGGCCTGC 2460 
2461 TCAGGGACTG TTCCTGTCTC AGCAACCAAG GGATTGTTCC TGTCAATCAA TGGTTTATTG 2520 
2521 GAAGGTGGCC CAGTATGAGC CCTAGAAGAG TGTGAAAAGG AATGGCAATG GTGTTCACCA 2580 
2581 TCGGCAGTGC CAGGGCAGCA CTCATTCACT TGATAAATGA ATATTTATTA GCTGGTTGGA 2640 
2641 GAGCTAGAAC CTGGAGAGCT AGAACCTGGA GAACTAGAAC CTGGAGGGCT AGAACCTGGA 2700 
2701 GAGGCTAGAA CCAAGAAGGG CTAGAACCTG G AGGGGC TAG AACCTAGAGA AGCTAAAACC 2760 
2761 TGAGCTAGAA GCTGGAGGAC TAGAACCTGG AGGGCTGGAA TCTGAAGGGC TAGAACCTGG 2820 
2821 AGGGCTGGAA TCTGGAGAGC TAGAACCTGG AGGGCTAGAA CCTGGAGGGC TAGAACCTAG 2880 
2881 AAGGGCTAGA ACCTGGAGGG CTGGAATCTG GAGAGCTAGA ACCTGGAGGG CTAGAACCTG 2940 
2941 GAGGGCTAGA ACCTAGAAGG GCTAGAACCT GGAGGGCTAG AACCTGGCAG GTTAGAACCT 3000 
3001 AGAAGGGCTA GAACCTGGAG AGCCAGAACC TGGAGGGCTA GAACCTGGAA GGGCTAGAAC 3060 
3061 CTGTAGAGCT AGAACATGGA GAGCTAGAAC CCGGCAGGCT AGAACCTGGC AAGCTAGAAC 3120 
3121 CTGGAGGGAA TGAACCTGGA GGGCTAGAAC CTGGAGAATG AGAAAAATTT ACATGGCAAA 3180 
3181 GAGCCCATAA ATCCTGACCA ATCCAACTCT GAATTTTAAA GCAAAAGCGT GAAAAAAAAG 3240 



FIG. 6D 

. . • 

3241 ATTCCCTCCT TACCCCCAAC CCACTCTTTT TTCCCACCAC CCACTCTCCT CTGCCTCAGT 3300 

3301 AAGTATCTGG AGGAAGAAAA CAGGTGAAAG AAGAAGTAAA AACCATTTAG TATTAGTATT 3360 
3361 AGAATGAAGT CAAACTGTGC CACACATGGT GAATGAAAAA AAAAAAAAAG AGGCTGTGTT 3420 
3421 TTGTCACACA GGGCAGTCAT TCAGCACCAG AGCACGTGAT GGTCTGAGAC TCTCTTAGGA 3480 
3481 GCAGAGCTCT GCCGCAATGG CCATGTGGGG ATCCACACCT GGTCTGAGGG GCAACTGAGT 3540 
3541 CTGCGGGAGA AGAGCGGCCC TATGCATGGT GTAGATGCCC TGATAAAGAA CATCTGTCCT 3600 
3601 GTGAAAGACT CAATGAGCTG TTATGTTGTA AACAGGAAGC ATTTCACATC CAAACGAGAA 3660 
3661 AATCATGTAA ACATGTGTCT TTTCTGTAGA GCATAATAAA TGGATGAGGT TTTTGCAAAA 3720 
3721 AAAAAAAAAA AAA 3733 



FIG. 7A 



1 AATTCAGAGC CACCGCGGGC AGGCGGGCAG TGCATCCAGA AGCGTTTATA TTCTGAGCGC 60 
61 CAGTTCAGCT TTCAAAAAGA GTGCTGCCCA TAAAAAGCCT TCCACCCTCC TGTCTGCTTT 120 
121 AGAAGGACCC TGAGCCCCAG GCGCCAGCCA CAGGACTCTG CTGCAGAGGG GGGTTGTGTA 180 
181 CAGATAGTAG GCTTTACGCC TAGCTTCGAA ATGGATAACG TCCTCCCGGT GGACTCAGAC 240 

241 CTCTCCCCAA ACATCTCCAC TAACACCTCG JAACCCAATC AGTTCGTGCA ACCAGCCTGG 300 
11LSPN 1ST NTS EPJNW f v 

301 CAAATTGTCC TTTGGGCAGC TGCCTACACG GTCATTGTGG TGACCTCTGT 360 

361 GTGGTAGTGA TGTGGATCAT CTTAGCCCAC AAAAGAATGA CGACAGTGAC GAACTATTTT 420 
51 V V V M WII L A H KRMK 

421 CTGG^AACC TGGCCTTCGC GGAGGCCTCC ATGGCTGCAT TCAATACAGT GGTGAACTTC 480 
71LVNL AFA EAS MAAF 

481 ACCTATGCTG TCCACAACGA ATGGTACTAC GGCCTGTTCT ACTGCAAGTT CCACAACTTC 540 

541 TTTCCCATCG CCGCTGTCTT CGCCAGTATC TACTCCATGA CGGCTGTGGC CTT^ATAGG 600 
111FPIA A V F ASI YSMT A v * 

601 TACATGGCCA TCATACATCC CCTCCAGCCC CGGCTGTCAG CCACAGCCAC CAAAGTGGTC 6i 6 5 O o 
131YMAI IHP LQP R L S A TAT 

661 ATCTGTGTCA TCTGGGTCCT GGCTCTCCTG CTGGCCTTCC CCCAGGGCTA CTACTCAACC 720 
151 ICVI W V L ALL LA*f u ^ 

721 ACAGAGACCA TGCCCAGCAG AGTCGTGTGC ATGATCGAAT GGCCAGAGCA TCCGAACAAG 780 
171 TETM P'SR VVC MIEW ^ 

781 ATTTATGAGA AAGTGTACCA CATCTGTGTG ACTGTGCTGA TCTACTTCCT CCCCCTGCT^ 840 
191IYEKVYHICVTVLIYt^ 

841 OT^TTGGCT ATGGATACAC CGTAGTGGGA ATCACACTAT GGGCCAGTGA GATCCCCGGG 900 
901 GACTCCTCTG ACCGC.ACCA CGAGCAAGTC TCTGCCAAGC GCAAGGXGG, CAAAA^TG 960 



FIG. 7B 

961 ATTGTCGTGG TGTGCACCTT CGCCATCTGC TGGCTGC CCT TCCACATCTT CTTCCTCCTG 1020 
251IVVV CTF AIC WLPF HIF F L L 270 

, • • 

1021 CCCTACATCA ACCCAGATCT CTACCTGAAG AAGTTTATCC AGCAGGTCTA CCTGGCCATC 1080 
271PYIN P D L Y L K KFIQ QVY 290 

1081 ATGTGGCTGG CCATGAGCTC CACCATGTAC AACCCCATCA TCTACTGCTG CCTCAATGAC 1140 



291 



" M ""w LA MSS TMY NPII YCC L N D 310 



1141 AGGTTCCGTC TGGGCTTCAA GCATGCCTTC CGGTGCTGCC CCTTCATCAG CGCCGGCGAC 1200 
R F R L G F K H A F R C C P Flo A G D JJU 

• * * 

TATGAGGGGC TGGAAATGAA ATCCACCCGG TATCTCCAGA CC 
YEGL EMK STR YLQT 

GTCAGCCGCC TGGAGACCAC CATCTCCACA GTGGTGGGGG CC 
351VSRL ETT 1ST VVGA HEE EPE 370 



1201 TATGAGGGGC TGGAAATGAA ATCCACCCGG TATCTCCAGA CCCAGGGCAG TGTGTACAAA 1260 
331 YEGL EMK STR YLQT QGS 350 



1261 



1321 GACGGCCCCA AGGCCACACC CTCGTCCCTG GACCTGACCT CCAACTGCTC TTCACGAAGT 1380 
371DGPK ATP SSL DLTS NCS SRS 390 

1381 GACTCCAAGA CCATGACAGA GAGCTTCAGC TTCTCCTCCA ATGTGCTCTC CTAGGCCACA 1440 
391DSKT MTE SFS FSSN VLS « 

1441 GGGCCTTTGG CAGGTGCAGC CCCCACTGCC TTTGACCTGC CTCCCTTCAT GCATGGAAAT 1500 

1501 TCCCTTCATC TGGAACCATC AGAAACACCC TCACACTGGG ACTTGCAAAA AGGGTCAGTA 1560 

1561 TGGGTTAGGG AAAACATTCC ATCCTTGAGT CAAAAAATCT CAATTCTTCC CTATCTTTGC 1620 

1621 CACCCTCATG CTGTGTGACT CAAACCAAAT CACTGAACTT TGCTGAGCCT GTAAAATAAA 1680 



1681 AGGTCGGACC 



AGCTTTTCCT CAAGAGCCCA ATGCATTCCA TTTCTGGAAG TGACTTTGGC 1740 



1741 TGCATGCGAG TGCTCATTTC AGGATG 1766 



FIG. 8A 

1 AATTCAGAGC CACCGCGGGC AGGCGGGCAG TGCATCCAGA AGCGTTTATA TTCTGAGCGC 60 
61 CAGTTCAGCT TTCAAAAAGA GTGCTGCCCA TAAAAAGCCT TCCACCCTCC TGTCTGCTTT 120 
121 AGAAGGACCC TGAGCCCCAG GCGCCAGCCA CAGGACTCTG CTGCAGAGGG GGGTTGTGTA 180 
181 CAGATAGTAG GCTTTACGCC TAGCTTCGAA ATGGATAACG TCCTCCCGGT GGACTCAGAC 240 

241 CTCTCCCCAA ACATCTCCAC TAACACCTCG GAACCCAATC AGTTCGTGCA ACCAGCCTGG 300 
ULSPN 1ST NTS EPNQ FVQ 

301 CAAATTGTCC rTTGGGCAGC TGCCTACACG GTCATTGTGG TGACCTCTGT GGTGGGCAAC 360 

361 GTGGTAGTGA TGTGGATCAT CTTAGCCCAC AAAAGAATGA GGACAGTGAC GAACTATTTT 420 
51VVVM WII L A H K R M. k 

421 CTGGTGAACC TGGCCTTCGC GGAGGCCTCC ATGGCTGCAT TCAATACAGT GGTGAACTTC 480 
71LVNL A F A EAS M A A * jmx 

481 ACCTATGCTG TCCACAACGA ATGGTACTAC G^CCTGTTCT ACTGCAAGTT CCACAACTTC 540 

541 TTCCCCATCG CCGCTGTCTT CGCCAGTATC TACTCCATGA CGGCTGTGGC CTTTGATAGG 600 
111 F P I A A V F ASI YSMT 

601 TACATGGCCA TCATACATCC CCTCCAGCCC CGGCTGTCAG CCACAGCCAC CAAAGTGGTC 660 
131 Y M A I IHP L Q P R I» S A 

661 ATCTGTGTCA TCTGGGTCCT GGCTCTCCTG CTGGCCTTCC CCCAGGGCTA CTACTCAACC 720 
151ICVI WVL ALL LAFP Q x 

™ ™- rrr rrr: -rrf ?rr hi 

781 ATTTATGftGA AA^O* CATCTQTOTG AOTOTGCTOA TCTACTTCCT OCCCCTOCri 840 
841 CTGATTGGCT ATOC A.ACAC ATCACACTAJ COOCCAC^ ^CCC^ 800 

,01 SACTCCTCTO ACC™ CGAGCAAGTC TC^CAAOC CCA— CAAAATQATO 880 



FIG. 8B 

961 ATTGTCGTGG TGTGCACCTT CGCCATCTGC TGGCTGCCCT TCCACATCTT CTTCCTCCTG 1020 
251IVVV CTF AIC W L P F HIF FLL 270 

1021 CCCTACATCA ACCCAGATCT CTACCTGAAG AAGTTTATCC AGCAGGTCTA CCTGGCCATC 1080 
271PYIN P D L Y L K KFIQ QVY LAI 290 

1081 ATGTGGCTGG CCATGAGCTC CACCATGTAC AACCCCATCA TCTACTGCTG CCTCAATGAC 1140 
291MWLA MSS TMY N P I I YCC L N D 310 

1141 AGGTTCCGTC TGGGCTTCAA GCATGCCTTC CGGTGCTGCC CCTTCATCAG CGCCGGCGAC 1200 
311 R F R L GFK HAF RCCP FIS AGD 330 

1201 TATGAGGGGC TGGAAATGAA ATCCACCCGG TATCTCCAGA CCCAGGGCAG TGTGTACAAA 1260 
331YEGL EMK STR Y L Q T QGS VYK 350 

1261 GTCAGCCGCC TGGAGACCAC CATCTCCACA GTGGTGGGGG CCCACGAGGA GGAGCCAGAG 1320 
351VSRL ETT 1ST VVGA HEE EPE 370 

1321 GACGGCCCCA AGGCCACACC CTCGTCCCTG GACCTGACCT CCAACTGCTC TTCACGAAGT 1380 
371DGPK ATP SSL DLTS NCS SRS 390 

1381 GACTCCAAGA CCATGACAGA GAGCTTCAGC TTCTCCTCCA ATGTGCTCTC CTAGGCCACA 1440 
391DSKT MTE SFS FSSN VLS * 407 

1441 GGGCCTTTGG CAGGTGCAGC CCCCACTGCC TTTGACCTGC CTCCCTTCAT GCATGGAAAT 1500 

• * * 

1501 TCCCTTCATC TGGAACCATC AGAAACACCC TCACACTGGG ACTTGCAAAA AGGGTCAGTA 1560 
1561 TGGGTTAGGG AAAACATTCC ATCCTTGAGT CAAAAAATCT CAATTCTTCC CTATCTTTGC 1620 
1621 CACCCTCATG CTGTGTGACT CAAACCAAAT CACTGAACTT TGCTGAGCCT GTAAAATAAA 1680 
1681 AGGTCGGACC AGCTTTTCCT CAAGAGCCCA ATGCATTCCA TTTCTGGAAG TGACTTTGGC 1740 



1741 TGCATGCGAG TGCTCATTTC AGGATG 1766 



FIG. 9A 

1 AATTCAGAGC CACCGCGGGC AGGCGGGCAG TGCATCCAGA AGCGTTTATA TTCTGAGCGC 60 

61 CAGTTCAGCT TTCAAAAAGA GTGCTGCCCA TAAAAAGCCT TCCACCCTCC TGTCTGCTTT 120 

121 AGAAGGACCC TGAGCCCCAG GCGCCAGCCA CAGGACTCTG CTGCAGAGGG GGGTTGTGTA 180 

181 CAGATAGTAG GCTTTACGCC TAGCTTCGAA ATGGATAACG TCCTCCCGGT GGACTCAGAC 240 
^ MDNVLiPVU^a*' aw 

241 CTCTCCCCAA ACATCTCCAC TAACACCTCG GAACCCAATC AGTTCGTGCA ACCAGCCTGG 300 
11LSPN 1ST NTS EPNQ FVQ 



301 



CAAATTGTCC TTTGGGCAGC TGCCTACACG GTCATTGTGG TGACCTCTGT GGTGGGCAAC 360 



31QIVL WAA AYT VIVV 



361 GTGGTAGTGA TGTGGATCAT CTTAGCCCAC AAAAGAATGA GGACAGTGAC GAACTATTTT 420 
VVVM WII L A H KRMR T V T 

. . • 

CTGGTGAACC TGGCCTTCGC GGAGGCCTCC ATGGC1 
LVNL AFA EAS MA 

.w* ACCTATGCTG TCCACAACGA ATGGTACTAC GGCCTC- 

91 T Y A V H N E W Y Y G L F Y C K F * 1J 



421 CTGGTGAACC TGGCCTTCGC GGAGGCCTCC ATGGCTGCAT TCAATACAGT GGTGAACTTC 480 
71 LVNL AFA EAS MAAF NTV 90 



481 



ACCTATGCTG TCCACAACGA ATGGTACTAC GGCCTGTTCT ACTGCAAGTT CCACAACTTC 540 



541 TTTCCCATCG CCGCTGTCTT CGCCAGTATC TACTCCATGA CGGCTGTGGC CTTTGATAGG 600 
111 F P I A A V F ASI YSMT AVA FDR 1JO 

601 TACATGGCCA TCATACATCC CCTCCAGCCC CGGCTGTCAG CCACAGCCAC CAAAGTGGTC 660 
131 Y M A I IHP LQP RLSA TAT KVV 

661 ATCTGTGTCA TATGGGTCCT GGCTCTCCTG CTGGCCTTCC CCCAGGGCTA CTACTCAACC 720 
151 I C V I WVL ALL LAFP QGY 

721 ACAGAGACCA TGCCCAGCAG AGTCGTGTGC ATGATCGAAT GGCCAGAGCA TCCGAACAAG 780 
171TETM PSR VVC MIEW 

781 ATTTATGAGA AAGTGTACCA CATCTGTGTG ACTGTGCTGA TCTACTTCCT CCCCCTGCTG 840 
191IYEK VYH ICV TVLI YFL 

841 GTGATTGGCT ATGCATACAC CGTAGTGGGA ATCACACTAT GGGCCAGTGA GATCCCCGGG 900 
211VIGY AYT VVG ITLW ASE IPG 230 

901 GACTCCTCTG ACCGCTACCA CGAGCAAGTC TCTGCCAAGC GCAAGGTGGT CAAAATGATG 960 
231DSSD RYH EQV SAKR KVV 250 



FIG. 9B 



961 ATTGTCGTGG TGTGCACCTT CGCCATCTGC TGGCTGCCCT TCCACATCTT CTTCCTCCTG 1020 
251IVVV CTF AIC WLPF HIF F L L 270 

1021 CCCTACATCA ACCCAGATCT CTACCTGAAG AAGTTTATCC AGCAGGTCTA CCTGGCCATC 1080 
271PYIN P D L YLK KFIQ QVY LAI 290 

1081 ATGTGGCTGG CCATGAGCTC CACCATGTAC AACCCCATCA TCTACTGCTG CCTCAATGAC 1140 
291MWLA MSS TMY NPII YCC 310 

1141 AGGTTCCGTC TGGGCTTCAA GCATGCCTTC CGGTGCTGCC CCTTCATCAG CGCCGGCGAC 1200 
311RFRL GFK HAF RCCP FIS AGO 330 

1201 TATGAGGGGC TGGAAATGAA ATCCACCCGG TATCTCCAGA CCCAGGGCAG TGTGTACAAA 1260 
331YEGL EMK STR YLQT QGS VYK 350 

1261 GTCAGCCGCC TGGAGACCAC CATCTCCACA GTGGTGGGGG CCCACGAGGA GGAGCCAGAG 1320 
351VSRL ETT 1ST VVGA HEE 370 

1321 GACGGCCCCA AGGCCACACC CTCGTCCCTG GACCTGACCT CCAACTGCTC TTCACGAAGT 1380 
371DGPK ATP SSL DLTS NCS SRS JSU 

1381 GACTCCAAGA CCATGACAGA GAGCTTCAGC TTCTCCTCCA ATGTGCTCTC CTAGGCCACA 1440 
391 DSKT MTE SFS FSSN VLS W 

1441 GGGCCTTTGG CAGGTGCAGC CCCCACTGCC TTTGACCTGC CTCCCTTCAT GCATGGAAAT 1500 

1501 TCCCTTCATC TGGAACCATC AGAAACACCC TCACACTGGG ACTTGCAAAA AGGGTCAGTA 1560 

1561 TGGGTTAGGG AAAACATTCC ATCCTTGAGT CAAAAAATCT CAATTCTTCC CTATCTTTGC 1620 

1621 CACCCTCATG CTGTGTGACT CAAACCAAAT CACTGAACTT TGCTGAGCCT GTAAAATAAA 1680 

1681 AGGTCGGACC AGCTTTTCCT CAAGAGCCCA ATGCATTCCA TTTCTGGAAG TGACTTTGGC 1740 



1741 TGCATGCGAG TGCTCATTTC AGGATG 1766 



FIG. 10A 



1 AATTCAGAGC CACCGCGGGC AGGCGGGCAG TGCATCCAGA AGCGTTTATA TTCTGAGCGC 60 
61 CAGTTCAGCT TTCAAAAAGA GTGCTGCCCA TAAAAAGCCT TCCACCCTCC TGTCTGCTTT 120 
121 AGAAGGACCC TGAGCCCCAG GCGCCAGCCA CAGGACTCTG CTGCAGAGGG GGGTTGTGTA 180 

MDNV LPV DSD 10 



181 CAGATAGTAG GCTTTACGCC TAGCTTCGAA ATGGATAACG TCCTCCCGGT GGAC TC AGAC 240 



241 CTCTCCCCAA ACATCTCCAC TAACACCTCG GAACCCAATC AGTTCGTGCA ACCAGCCXGG 300 
11LSPN 1ST NTS EPWW r v w 

301 CAAATTGTCC TTTGGGCAGC TGCCTACACG GTCATTGTGG TGACCTCTGT GGTGGGCAAC 360 
31 Q I V L W A A AYT VIVV TSV 

361 GTGGTAGTGA TGTGGATCAT CTTAGCCCAC AAAAGAATGA OGACAGTGAC GAACTA^TT 420 
51 V V V M WII LAH KRMR l v j. 

421 CTGGTGAACC TGGCCTTCGC GGAGGCCTCC ATGGCTGCAT TCAATACAGT GGTGAACTTC 480 
71LVNL AFA EAS MAAF 

481 ACCTATGCTG TCCACAACGA ATGGTACTAC GGCC TC TTCT ACTGCAAGTT CCACAACTTC 540 

541 T^TCCCATCG CCGCTGTCTT CGCCAGTATC TACTCCATGA CGGCTGTGGC CTTTGATAGG 600 

601 TACATGGCCA TCATACATCC CCTCCAGCCC CGGCTGTCAG CCACAGCCAC CAAAGTGGTC 660 
131 Y M A I IHP L Q P RLSATAi 

661 ATCTGTGTCA TCTGGGTCCT GGCTCTCCTG CTGGCCTTCC CCCAGGGCTA CTACTCAACC 720 
151 I C V I W V L ALL LAFP QGY 

721 ACAGAGACCA TGCCCAGCAG AGTCGTGTGC ATGATCGAAT GGCCAGAGCA TCCGAACAAG 780 
171TETM PSR VVC M I E W F * n 

781 ATTTATGAGA AAGTGTACCA CATCTGTGTG ACTGTGCTGA TCTACTTCCT CCCCCTGCTG 840 

841 GTGATTGGCT ATGCATACAC CGTAGTGGGA ATCACACTAT GGGCCAGTGA GATCCCCGGG 900 
211 V I G Y AYT VVG ITLW ASF. 

901 GACTCCTCTG ACCGCTACCA CGAGCAAGTC TCTGCCAAGC GCAAGGTGGT CAAAATGATG 960 
231 DSSD RYH EQV SAKK * 



FIG. 10B 

961 ATTGTCGTGG TGTGCACCTT CGCCATCTGC TGGCTGCCCT TCCACATCTT CTTCCTCCTG 1020 
251IVVV CTF AIC W L P F HIF F L L 270 

. ■ • 

1021 CCCTACATCA ACCCAGATCT CTACCTGAAG AAGTTTATCC AGCAGGTCTA CCTGGCCATC 1080 
271PYIN PDL Y L K KFIQ QVY LAI 290 

1081 ATGTGGCTGG CCATGAGCTC CACCATGTAC AACCCCATCA TCTACTGCTG CCTCAATGAC 1140 
291 M W L A MSS TMY NPII YCC LND 310 

1141 AGGTTCCGTC TGGGCTTCAA GCATGCCTTC CGGTGCTGCC CCTTCATCAG CGCCGGCGAC 1200 
311RFRL GFK HAF RCCP FIS AGD 330 

1201 TATGAGGGGC TGGAAATGAA ATCCACCCGG TATCTCCAGA CCCAGGGCAG TGTGTACAAA 1260 
331YEGL EMK STR YLQT QGS VYK 350 

• • * 

1261 GTCAGCCGCC TGGAGACCAC CATCTCCACA GTGGTGGGGG CCCACGAGGA GGAGCCAGAG 1320 
351VSRL ETT 1ST VVGA HEE EPE 370 

1321 GACGGCCCCA AGGCCACACC CTCATCCCTG GACCTGACCT CCAACTGCTC TTCACGAAGT 1380 
371DGPK ATP SSL DLTS NCS SRS 390 

1381 GACTCCAAGA CCATGACAGA GAGCTTCAGC TTCTCCTCCA ATGTGCTCTC CTAGGCCACA 1440 
391DSKT MTE SFS FSSN VLS " 407 

. • • 

1441 GGGCCTTTGG CAGGTGCAGC CCCCACTGCC TTTGACCTGC CTCCCTTCAT GCATGGAAAT 1500 
1501 TCCCTTCATC TGGAACCATC AGAAACACCC TCACACTGGG ACTTGCAAAA AGGGTCAGTA 1560 
1561 TGGGTTAGGG AAAACATTCC ATCCTTGAGT CAAAAAATCT CAATTCTTCC CTATCTTTGC 1620 
1621 CACCCTCATG CTGTGTGACT CAAACCAAAT CACTGAACTT TGCTGAGCCT GTAAAATAAA 1680 
1681 AGGTCGGACC AGCTTTTCCT CAAGAGCCCA ATGCATTCCA TTTCTGGAAG TGACTTTGGC 1740 



1741 TGCATGCGAG TGCTCATTTC AGGATG 1766 



FIG. 11A 



1 AGTCTGCACT GGAGCTGCCT GGTGACCAGA AGTTTGGAGT CCGCTGACGT CGCCGCCCAG 60 

61 ATGGCCTCCA GGCTGACCCT GCTGACCCTC CTGCTGCTGC TGCTGGCTGG GGATAGAGCC 120 
1MASR L T L L T L L L L L LAG D RA 20 

121 TCCTCAAATC CAAATGCTAC CAGCTCCAGC TCCCAGGATC CAGAGAGTTT GCAAGACAGA 180 
21SSNP NAT SSS SQDP ESL QDR 40 

181 GGCGAAGGGA AGGTCGCAAC AACAGTTATC TCCAAGATGC TATTCGTTGA ACCCATCCTG 240 
41GEGK VAT TVI S K M L FVE P I L 60 

241 GAGGTTTCCA GCTTGCCGAC AACCAACTCA ACAACCAATT CAGCCACCAA AATAACAGCT 300 
61 E V S S LPT TNS TTNS ATK ITA 80 



301 AATACCACTG ATGAACCCAC CACACAACCC ACCACAGAGC CCACCACCCA ACCCACCATC 360 
81NTTD E P T TQP TTEP TTQ PTI 100 



361 CAACCCACCC AACCAACTAC CCAGCTCCCA ACAGATTCTC CTACCCAGCC CACTACTGGG 420 
101QPTQ PTT Q L. P TDSP TQP TTG 120 

421 TCCTTCTGCC CAGGACCTGT TACTCTCTGC TCTGACTTGG AGAGTCATTC AACAGAGGCC 480 
121 S F C P GPV TLC SDLE SHS TEA 140 



481 GTGTTGGGGG ATGCTTTGGT AGATTTCTCC CTGAAGCTCT ACCACGCCTT CTCAGCAATG 540 
141VLGD ALV DPS LKLY HAF SAM 160 



541 AAGAAGGTGG AGACCAACAT GGCCTTTTCC CCATTCAGCA TCGCCAGCCT CCTTACCCAG 600 
161KKVE TNM AFS PFSI ASL LTQ 180 



601 GTCCTGCTCG GGGCTGGGCA GAACACCAAA ACAAACCTGG AGAGCATCCT CTCTTACCCC 660 
181 V L Li G AGQ NTK TNLE S I L SYP 200 

♦ 

661 AAGGACTTCA CCTGTGTCCA CCAGGCCCTG AAGGGCTTCA CGACCAAAGG TGTCACCTCA 720 
201KDFT CVH QAL KGFT TKG VTS 220 

721 GTCTCTCAGA TCTTCCACAG CCCAGACCTG GCCATAAGGG ACACCTTTGT GAATGCCTCT 780 
221VSQI FHS PDL AIRD TFV N A S 240 



781 CGGACCCTGT ACAGCAGCAG CCCCAGAGTC CTAAGCAACA ACAGTGACGC CAACTTGGAG 840 
241RTLY SSS PRV LSNN SDA NLE 260 

841 CTCATCAACA CCTGGGTGGC CAAGAACACC AACAACAAGA TCAGCCGGCT GCTAGACAGT 900 
261 LINT WVA KNT NNKI SRL LDS 280 



, . • 

901 CTGCCCTCCG ATACCCGCCT TGTCCTCCTC AATGCTATCT ACCTGAGTGC CAAGTGGAAG 960 
281LPSD TRL VLL NAIY L S A KWK 300 



FIG. 11B 



961 ACAACATTTG ATCCCAAGAA AACCAGAATG 
301 TTFD PKK TRM 

• • • 

1021 AAAGTGC CCA TGATGAATAG CAAGAAGTAC 
321KVPM MNS KKY 



1081 AAAGCCAAGG TGGGGCAGCT GCAGCTCTCC 
341KAKV GQL QLS 



1141 CAGAACCTGA AACATCGTCT TGAAGACATG 
361 QNLK HRL EDM 

1201 GCCATCATGG AGAAACTGGA GATGTCCAAG 
381 AIME KLE MSK 

# . • 

1261 ATCAAAGTGA CGACCAGCCA GGATATGCTC 
401 IKVT TSQ DML 

. * * 

1321 TTTTCTTATG ACCTTAACCT GTGTGGGCTG 
421 FSYD LNL CGL 

1381 ATGCAGCACC AGACAGTGCT GGAACTGACA 
441 MQHQ TVL ELT 

1441 GCCATCTCTG TGGCCCGCAC CCTGCTGGTC 
461 AISV ART LLV 

1501 CTCTGGGACC AGCAGCACAA GTTCCCTGTC 
481LWDQ QHK FPV 

1561 TGAGACCTGC AGGATCAGGT TAGGGCGAGC 

1621 AGCCCTGCTG CTGCCTGCCT GGACTTGCCC 

. • • 

1681 CCACCAAAAG GGCTCCTGAG GGTCTGGGCA 

1741 GGCCCTGCCA TGCTCTCCAA ACCACTTTTT 



GAACCCTTTC ACTTCAAAAA CTCAGTTATA 1020 
EPFH FKN SVI 320 



, • • 

CCTGTGGCCC ATTTCATTGA CCAAACTTTG 1080 
PVAH FID QTL 340 



• ■ ■ 

CACAATCTGA GTTTGGTGAT CCTGGTACCC 1140 
HNLS LVI LVP 360 



. • • 

GAACAGGCTC TCAGCCCTTC TGTTTTCAAG 1200 
EQAL SPS VFK 380 



• • * 

TTCCAGCCCA CTCTCCTAAC ACTACCCCGC 1260 
FQPT LLT LPR 400 

TCAATCATGG AGAAATTGGA ATTCTTCGAT 1320 
SIME KLE FFD 420 



. . * 

ACAGAGGACC CAGATCTTCA GGTTTCTGCG 1380 
TEDP DLQ VSA 440 

GAGACTGGGG TGGAGGCGGC TGCAGCCTCC 1440 
ETGV E A A A A S 460 



TTTGAAGTGC AGCAGCCCTT CCTCTTCGTG 1500 
FEVQ QPF LFV 480 

TTCATGGGGC GAGTATATGA CCCCAGGGCC 1560 
FMGR VYD PRA 500 

. • • 

GCTACCTCTC CAGCCTCAGC TCTCAGTTGC 1620 

CTGCCACCTC CTGCCTCAGG TGTCCGCTAT 1680 

AGGGACCTGC TTCTATTAGC CCTTCTCCAT 1740 



GCAGCTTTCT CTAGTTCAAG TTCACCAGAC 1800 



1801 TCTATAAATA AAACCTGACA GACCAT 1826 



FIG. 12A 



1 AGTCTGCACT GGAGCTGCCT GGTGACCAGA AGTTTGGAGT CCGCTGACGT CGCCGCCCAG 60 

61 ATGGCCTCCA GGCTGACCCT GCTGACCCTC CTGCTGCTGC TGCTGGCTGG GGATAGAGCC 120 
1MASR LTL LTL L L L L LAG DRA 20 

121 TCCTCAAATC CAAATGCTAC CAGCTCCAGC TCCCAGGATC CAGAGAGTTT GCAAGACAGA 180 
21SSNP NAT SSS SQDP ESL QDR 40 

181 GGCGAAGGGA AGGTCGCAAC AACAGTTATC TCCAAGATGC TATTCGTTGA ACCCATCCTG 240 
41GEGK VAT TVI SKML FVE PIL 60 



241 GAGGTTTCCA GCTTGCCGAC AACCAACTCA ACAACCAATT CAGCCACCAA AATAACAGCT 300 
61 E V S S LPT TNS TTNS ATK ITA 80 



301 AATACCACTG ATGAACCCAC CACACAACCC ACCACAGAGC CCACCACCCA ACCCACCATC 360 
81NTTD EPT TQP TTEP TTQ PTI 100 



361 CAACCCACCC AACCAACTAC CCAGCTCCCA ACAGATTCTC CTACCCAGCC CACTACTGGG 420 
101QPTQ PTT QLP TDSP TQP TTG 120 



421 TCCTTCTGCC CAGGACCTGT TACTCTCTGC TCTGACTTGG AGAGTCATTC AACAGAGGCC 480 
121SFCP GPV TLC SDLE SHS TEA 140 



481 GTGTTGGGGG ATGCTTTGGT AGATTTCTCC CTGAAGCTCT ACCACGCCTT CTCAGCAATG 540 
141VLGD ALV DFS LKLY HAF SAM 160 



541 AAGAAGGTGG AGACCAACAT GGCCTTTTCC CCATTCAGCA TCGCCAGCCT CCTTACCCAG 600 
161KKVE TNM AFS PFSI ASL LTQ 180 



601 GTCCTGCTCG GGGCTGGGCA GAACACCAAA ACAAACCTGG AGAGCATCCT CTCTTACCCC 660 
181VLLG A G Q N T K TNLE SIL SYP 200 



661 AAGGACTTCA CCTGTGTCCA CCAGGCCCTG AAGGGCTTCA CGACCAAAGG TGTCACCTCA 720 
201KDFT CVH QAL KGFT TKG VTS 220 



721 GTCTCTCAGA TCTTCCACAG CCCAGACCTG GCCATAAGGG ACACCTTTGT GAATGCCTCT 780 
221VSQI FHS PDL AIRD TFV NAS 240 

781 CGGACCCTGT ACAGCAGCAG CCCCAGAGTC CTAAGCAACA ACAGTGACGC CAACTTGGAG 840 
241RTLY SSS PRV LSNN SDA NLE 260 



841 CTCATCAACA CCTGGGTGGC CAAGAACACC AACAACAAGA TCAGCCGGCT GCTAGACAGT 900 
261 LINT WVA KNT NNKI SRL LDS 280 



901 CTGCCCTCCG ATACCCGCCT TGTCCTCCTC AATGCTATCT ACCTGAGTGC CAAGTGGAAG 960 
281LPSD TRL VLL NAIY LSA KWK 300 



FIG. 12B 

961 ACAACATTTG ATCCCAAGAA AACCAGAATG GAACCCTTTC ACTTCAAAAA CKTCRATA 1020 
301TTFD PKK TRM EPFH FKN SVJ. 

1021 AAAGTGCCCA TGATGAATAG CAAGAAGTAC CCTCTGGCCC ATTTCATTGA CCAAACTTTG 1080 
321KVPM MNS KKY PVAH 

1081 AAAGCCAAGG TGGGGCAGCT GCAGCTCTCC CACAATCTGA GTTTGGTGAT CCTGGTACCC 1140 
341KAKV GQL QLS HNLS L V I L. V P 

1141 CAGAACCTGA AACATCGTCT TGAAGACATG GAACAGGCTC TCAGCCCTTC TGTTTTCAAG 1200 



361QNLK HRL EDM EQA 



L S P S V F K 380 



1201 GCCATCATGG AGAAACTGGA GATGTCCAAG TTCCAGCCCA CTCTCCTAAC ACTACCCCGC 1260 
381AIME KLE MSK FQPT 

1261 ATCAAAGTGA CGACCAGTCA GGATATGCTC TCAATCATGG AGAAATTGGA ATTCTTCGAT 1320 
401IKVT TSQ DML SIME KLE 

1321 TTTTCTTATG ACCTTAACCT GTGTGGGCTG ACAGAGGACC CAGATCTTCA GGTTTCTGCG 1380 
421FSYD L N Ij CGL TED P DLQ VSA 

1381 ATGCAGCACC AGACAGTGCT GGAACTGACA GAGACTGGGG ^GAGGCGGC TGCAGCCTCC 1440 
441MQHQ TVL ELT ETGV 



1441 GCCATCTCTG TGGCCCGCAC CCTGCTGGTC TTTGAAGTGC AGCAGCCCTT CCTCTTCGTG 1500 
461 AISV ART L.LV FE 



VQ QPF Ij F V 480 



1501 CTCTGGGACC AGCAGCACAA GTTCCCTGTC ^CATGGGGC GAGTATATGA CCCCAGGGCC 1560 

1561 TGAGACCTGC AGGATCAGGT TAGGGCGAGC GCTACCTCTC CAGCCTCAGC TCTCAGTTGC 1620 
1621 AGCCCTGCTG CTGCCTGCCT GGACTTGCCC CTGCCACCTC CTGCCTCAGG TGTCCGCTAT 1680 
1681 CCACCAAAAG GGCTCCTGAG GGTCTGGGCA AGGGACCTGC TTCTATTAGC CCTTCTCCAT 1740 
1741 GGCCCTGCCA TGCTCTCCAA ACCACTTTTT GCAGCTTTCT CTAGTTCAAG TTCACCAGAC 1800 



1801 TCTATAAATA AAACCTGACA GACCAT 1826 



FIG. 13A 



1 AGTCTGCACT GGAGCTGCCT GGTGACCAGA AGTTTGGAGT CCGCTGACGT CGCCGCCCAG 60 

61 ATGGCCTCCA GGCTGACCCT GCTGACCCTC CTGCTGCTGC TGCTGGCTGG GGATAGAGCC 120 
1MASR LTL LTL LLLL LAG DRA 20 

121 TCCTCAAATC CAAATGCTAC CAGCTCCAGC TCCCAGGATC CAGAGAGTTT GCAAGACAGA 180 
21SSNP NAT SSS SQDP E S L QDR 40 

181 GGCGAAGGGA AGGTCGCAAC AACAGTTATC TCCAAGATGC TATTCGCTGA ACCCATCCTG 240 
41GEGK VAT TVI S K M L F A E PIL 60 

241 GAGGTTTC C A GCTTGCCGAC AACCAACTCA ACAACCAATT CAGCCACCAA AATAACAGCT 300 
61EVSS LPT TNS TTNS ATK ITA 80 

301 AATACCACTG ATGAACCCAC CACACAACCC ACCACAGAGC CCACCACCCA ACCCACCATC 360 
81 N T T D EPT TQP TTEP TTQ PTI 100 

361 CAACCCACCC AACCAACTAC CCAGCTCCCA ACAGATTCTC CTACCCAGCC CACTACTGGG 420 
101QPTQ PTT QLP TDS P TQP 120 

421 TCCTTCTGCC CAGGACCTGT TACTCTCTGC TCTGACTTGG AGAGTCATTC AACAGAGGCC 480 
121SFCP GPV TLC SDLE SHS TEA 140 

481 GTGTTGGGGG ATGCTTTGGT AGATTTCTCC CTGAAGCTCT ACCACGCCTT CTCAGCAATG 540 
141VLGD ALV DFS LKLY HAF SAM 160 

541 AAGAAGGTGG AGACCAACAT GGCCTTTTCC CCATTCAGCA TCGCCAGCCT CCTTACCCAG 600 
161KKVE TNM AFS PFSI ASL LTQ 

601 GTCCTGCTCG GGGCTGGGCA GAACACCAAA ACAAACCTGG AGAGCATCCT CTCTTACCCC 660 
181VLLG AGQ NTK TNLE SIL SYP 200 

661 AAGGACTTCA CCTGTGTCCA CCAGGCCCTG AAGGGCTTCA CGACCAAAGG TGTCACCTCA 720 
201KDFT CVH QAL KGFT TKG VTS 220 

721 GTCTCTCAGA TCTTCCACAG CCCAGACCTG GCCATAAGGG ACACCTTTGT GAATGCCTCT 780 
221VSQI FHS PDL AIRD TFV 240 

. • • 

781 CGGACCCTGT ACAGCAGCAG CCCCAGAGTC CTAAGCAACA ACAGTGACGC CAACTTGGAG 840 
241RTLY SSS PRV LSNN SDA NLE 260 

841 CTCATCAACA CCTGGGTGGC CAAGAACACC AACAACAAGA TCAGCCGGCT GCTAGACAGT 900 
261 LINT WVA KNT NNKI SRL 280 

901 CTGCCCTCCG ATACCCGCCT TGTCCTCCTC AATGCTATCT ACCTGAGTGC CAAGTGGAAG 960 
281LPSD TRL VLL NAIY LSA KWK 300 



FIG. 13B 

961 ACAACATTTG ATCCCAAGAA AACCAGAATG GAACCCTTTC ACTTCAAAAA CTCAGTTATA 1020 
301TTFD PKK TRM EPFH FKN SVI 320 

1021 AAAGTGCCCA TGATGAATAG CAAGAAGTAC CCTGTGGCCC ATTTCATTGA CCAAACTTTG 1080 
321KVPM MNS KKY PVAH FID QTL 340 

1081 AAAGCCAAGG TGGGGCAGCT GCAGCTCTCC CACAATCTGA GTTTGGTGAT CCTGGTACCC 1140 
341KAKV GQL QLS HNLS L V I LVP 360 

1141 CAGAACCTGA AACATCGTCT TGAAGACATG GAACAGGCTC TCAGCCCTTC TGTTTTCAAG 1200 
361QNLK HRL EDM EQAL SPS VFK 380 

1201 GCCATCATGG AGAAACTGGA GATGTCCAAG TTCCAGCCCA CTCTCCTAAC ACTACCCCGC 1260 
381AIME KLE MSK FQPT L L T LPR 400 

1261 ATCAAAGTGA CGACCAGCCA GGATATGCTC TCAATCATGG AGAAATTGGA ATTCTTCGAT 1320 
401IKVT TSQ DML SIME KLE FFD 420 

1321 TTTTCTTATG ACCTTAACCT GTGTGGGCTG ACAGAGGACC CAGATCTTCA GGTTTCTGCG 1380 
421FSYD L N L CGL TEDP DLQ VSA 440 

1381 ATGCAGCACC AGACAGTGCT GGAACTGACA GAGACTGGGG TGGAGGCGGC TGCAGCCTCC 1440 
441MQHQ TVL ELT ETGV EAA AAS 460 

1441 GCCATCTCTG TGGCCCGCAC CCTGCTGGTC TTTGAAGTGC AGCAGCCCTT CCTCTTCGTG 1500 
461AISV ART LLV FEVQ QPF L F V 480 

1501 CTCTGGGACC AGCAGCACAA GTTCCCTGTC TTCATGGGGC GAGTATATGA CCCCAGGGCC 1560 
481 Li W D Q QHK FPV FMGR VYD PRA 500 

1561 TGAGACCTGC AGGATCAGGT TAGGGCGAGC GCTACCTCTC CAGCCTCAGC TCTCAGTTGC 1620 

1621 AGCCCTGCTG CTGCCTGCCT GGACTTGCCC CTGCCACCTC CTGCCTCAGG TGTCCGCTAT 1680 

1681 CCACCAAAAG GGCTCCTGAG GGTCTGGGCA AGGGACCTGC TTCTATTAGC CCTTCTCCAT 1740 

• * * 

1741 GGCCCTGCCA TGCTCTCCAA ACCACTTTTT GCAGCTTTCT CTAGTTCAAG TTCACCAGAC 1800 



1801 TCTATAAATA AAACCTGACA GACCAT 1826 



FIG. 14A 



1 AGTCTGCACT GGAGCTGCCT GGTGACCAGA 



61 ATGGCCTCCA GGCTGACCCT GCTGACCCTC 
1MASR LTL LTL 

. - • 

121 TCCTCAAATC CAAATGCTAC CAGCTCCAGC 
21SSNP NAT SSS 



181 GGCGAAGGGA AGGTCGCAAC AACAGTTATC 
41GEGK VAT TVI 



241 GAGGTTTCCA GCTTGCCGAC AACCAACTCA 
61EVSS LPT TNS 



301 AATACCACTG ATGAACCCAC CACACAACCC 
81NTTD EPT TQP 

361 CAACCCACCC AACCAACTAC CCAGCTCCCA 
101 QPTQ PTT QLP 



421 TCCTTCTGCC CAGGACCTGT TACTCTCTGC 
121 SFCP GPV TL.C 

• • • 
481 GTGTTGGGGG ATGCTTTGGT AGATTTCTCC 
141 V L G D A L V D F S 

541 AAGAAGGTGG AGACCAACAT GGCCTTTTCC 
161 KKVE TNM AFS 

601 GTCCTGCTCG GGGCTGGGCA GAACACCAAA 
181 V L L G AGQ NTK 

• • • 
661 AAGGACTTCA CCTGTGTCCA CCAGGCCCTG 
201 KDFT CVH QAL 

721 GTCTCTCAGA TCTTCCACAG CCCAGACCTG 
221 VSQI FHS PDL 

• 

781 CGGACCCTGT ACAGCAGCAG CCCCAGAGTC 
241 RTLY SSS PRV 

• • • 
841 CTCATCAACA CCTGGGTGGC CAAGAACACC 
261 LINT WVA KNT 



• • • 

AGTTTGGAGT CCGCTGACGT CGCCGCCCAG 60 



CTGCTGCTGC TGCTGGCTGG GG AT AGAGC C 120 
L.LLL LAG DRA 20 

. • • 

TCCCAGGATC CAGAGAGTTT GCAAGACAGA 180 
SQDP ESL QDR 40 



TCCAAGATGC TATTCGTTGA ACCCATCCTG 240 

SKML FVE PIL 60 

• • • 
ACAACCAATT CAGCCACCAA AATAACAGCT 300 
TTNS ATK ITA 80 

• • • 
ACCACAGAGC CCACCACCCA ACCCACCATC 360 
TTEP TTQ PTI 100 

ACAGATTCTC CTACCCAGCC CACTACTGGG 420 

TDSP TQP TTG 120 



TCTGACTTGG AGAGTCATTC AACAGAGGCC 480 
SDLE SHS TEA 140 



CTGAAGCTCT ACCACGCCTT CTCAGGAATG 540 
LKLY HAF SGM 160 

CCATTCAGCA TCGCCAGCCT CCTTACCCAG 600 
PFSI ASL LTQ 180 



• • • 

AC AAACC TGG AGAGCATCCT CTCTTACCCC 660 

TNLE SIL SYP 200 

• « * 
AAGGGCTTCA CGACCAAAGG TGTCACCTCA 720 
KGFT TKG VTS 220 



GCCATAAGGG ACACCTTTGT GAATGCCTCT 780 
AIRD TFV NAS 240 



• • • 

CTAAGCAACA ACAGTGACGC CAACTTGGAG 840 
LSNN SDA NLE 260 



AACAACAAGA TCAGCCGGCT GCTAGACAGT 900 
NNKI SRL LDS 280 



901 CTGCCCTCCG ATACCCGCCT TGTCCTCCTC AATGCTATCT ACCTGAGTGC CAAGTGGAAG 960 
281 LPSD TRL VLL NAIY LSA KWK 300 



FIG. 14B 



961 ACAACATTTG ATCCCAAGAA AACCAGAATG 
301 TTFD PKK TRM 



1021 AAAGTGCCCA TGATGAATAG CAAGAAGTAC 
321 KVPM MNS KKY 

• • • 
1081 AAAGCCAAGG TGGGGCAGCT GCAGCTCTCC 

341 KAKV GQL QLS 

• • • 
1141 CAGAACCTGA AACATCGTCT TGAAGACATG 

361 QNLK HRL EDM 

1201 GCCATCATGG AGAAACTGGA GATGTCCAAG 
381 AIME KLE MSK 

• • • 
1261 ATCAAAGTGA CGACCAGCCA GGATATGCTC 

401 IKVT TSQ DML 

• • • 
1321 TTTTCTTATG ACCTTAACCT GTGTGGGCTG 

421 FSYD LNL CGL 

1381 ATGCAGCACC AGACAGTGCT GGAACTGACA 
441 MQHQ TVL ELT 

» • • 

1441 GCCATCTCTG TGGCCCGCAC CCTGCTGGTC 
461 AISV A RT LLV 

1501 CTCTGGGACC AGCAGCACAA GTTCCCTGTC 
481 LWDQ QHK FPV 

1561 TGAGACCTGC AGGATCAGGT TAGGGCGAGC 

♦ 

1621 AGCCCTGCTG CTGCCTGCCT GGACTTGCCC 



1681 CCACCAAAAG GGCTCCTGAG GGTCTGGGCA 
1741 GGCCCTGCCA TGCTCTCCAA ACCACTTTTT 



GAACCCTTTC ACTTCAAAAA CTCAGTTATA 1020 
E PFH FKN SVI 320 



CCTGTGGCCC ATTTCATTGA CCAAACTTTG 1080 
PVAH FID QTL 340 



* 

CACAATCTGA GTTTGGTGAT CCTGGTACCC 1140 
HNLS L V I LVP 360 



GAACAGGCTC TCAGCCCTTC TGTTTTCAAG 1200 
EQAL SPS VFK 380 



TTCCAGCCCA CTCTCCTAAC ACTACCCCGC 1260 
FQPT LLT LPR 400 

TCAATCATGG AGAAATTGGA ATTCTTCGAT 1320 
SIME KLE FFD 420 

ACAGAGGACC CAGATCTTCA GGTTTCTGCG 1380 
TEDP DLQ VSA 440 

GAGACTGGGG TGGAGGCGGC TGCAGCCTCC 1440 
ETGV E A A A A S 460 

• * • 

TTTGAAGTGC AGCAGCCCTT CCTCTTCGTG 1500 
FEVQ QPF LFV 480 



TTCATGGGGC GAGTATATGA CCCCAGGGCC 1560 
FMGR VYD PRA 500 



GCTACCTCTC CAGCCTCAGC TCTCAGTTGC 1620 



CTGCCACCTC CTGCCTCAGG TGTCCGCTAT 1680 



AGGGACCTGC TTCTATTAGC CCTTCTCCAT 1740 

• • • 

GCAGCTTTCT CTAGTTCAAG TTCACCAGAC 1800 



1801 TCTATAAATA AAACCTGACA GACCAT 1826 



FIG. 15A 



1 AGTCTGCACT GGAGCTGCCT GGTGACCAGA AGTTTGGAGT CCGCTGACGT CGCCGCCCAG 60 

61 ATGGCCTCCA GGCTGACCCT GCTGACCCTC CTGCTGCTGC TGCTGGCTGG GGATAGAGCC 120 
1MASR LTL LTL L L L L LAG DRA 20 

• ••••* 
121 TCCTCAAATC CAAATGCTAC CAGCTCCAGC TCCCAGGATC CAGAGAGTTT GCAAGACAGA 180 
21SSNP NAT SSS SQDP E S L QDR 40 

181 GGCGAAGGGA AGGTCGCAAC AACAGTTATC TCCAAGATGC TATTCGTTGA ACCCATCCTG 240 
41GEGK VAT TVI S K M L FVE PIL 60 

241 GAGGTTTCCA GC TTGC C G AC AACCAACTCA ACAACCAATT CAGCCACCAA AATAACAGCT 300 
61EVSS LPT TNS TTNS ATK ITA 80 



301 AATACCACTG ATGAACCCAC CACACAACCC ACCACAGAGC CCACCACCCA ACCCACCATC 360 
81NTTD EPT TQP TTEP TTQ PTI 100 



361 CAACCCACCC AACCAACTAC CCAGCTCCCA ACAGATTCTC CTACCCAGCC CACTACTGGG 420 
101QPTQ PTT QLP TDSP TQP TTG 120 

421 TCCTTCTGCC CAGGACCTGT TACTCTCTGC TCTGACTTGG AGAGTCATTC AACAGAGGCC 480 
121 SFCP GPV TLC SDLE SHS TEA 140 

481 GTGTTGGGGG ATGCTTTGGT AGATTTCTCC CTGAAGCTCT ACCACGCCTT CTCAGCAATG 540 
141VLGD ALV DFS LKLY HAF SAM 160 



541 AAGAAGGTGG AGACCAACAT GGCCTTTTCC CCATTCAGCA TCGCCAGCCT CCTTACCCAG 600 
161KKVE TNM AFS PFSI ASL LTQ 180 



601 GTCCTGCTCG GGGCTGGGCA GAACACCAAA ACAAACCTGG AGAGCATCCT CTCTTACCCC 660 
181VLLG AGQ NTK TNLE SIL SYP 200 



661 AAGGACTTCA CCTGTGTCCA CCAGGCCCTG AAGGGCTTCA CGACCAAAGG TGTCACCTCA 720 
201KDFT CVH QAL KGFT TKG VTS 220 



721 GTCTCTCAGA TCTTCCACAG CCCAGACCTG GCCATAAGGG ACACCTTTGT GAATGCCTCT 780 
221VSQI FHS PDL AIRD TFV NAS 240 

781 CGGACCCTGT ACAGCAGCAG CCCCAGAGTC CTAAGCAACA ACAGTGACGC CAACTTGGAG 840 
241RTLY SSS PRV LSNN SDA N L E 260 

841 CTCATCAACA CCTGGGTGGC CAAGAACACC AACAACAAGA TCAGCCGGCT GCTAGACAGT 900 
261 LINT WVA KNT NNKI SRL LDS 280 

901 CTGCCCTCCG ATACCCGCCT TGTCCTCCTC AATGCTATCT ACCTGAGTGC CAAGTGGAAG 960 
281LPSD TRL VLL NAIY LSA KWK 300 



FIG. 15B 

961 ACAACATTTG ATC C CAAGAA AACCAGAATG GAACCCTTTC ACTTCAAAAA CTCAGTTATA 1020 

1021 AAAGTGCCCA TGATGAATAG CAAGAAGTAC CCTGTGGCCC ATTTCATTGA CCAAACTTTG 1080 
321KVPM MNS KKY PVAH Q 1 

1081 AAAGCCAAGG TGGGGCAGCT GCAGCTCTCC CACAATCTGA GTTTGG^AT CCTCGTACCC 1140 
341KAKV GQL QLS HNLS LVX 

1141 CAGAACCTGA AACATCGTCT TGAAGACATG GAACAGGCTC TCAGCOCPTC TGTTTTCAAG 1200 
361QNLK HRL EDM EQAL SPS 

1201 GCCATCATGG AGAAACTGGA GATGTCCAAG ™CAGCCCA CTCTCCTAAC ACTACCCCGC 1260 
381 AIMEKLEMSKFUf^ 

1261 ATCAAAGTGA CGACCAGCCA GGATATGCTC TCAATCATGG AGAAATTGGA ATTCTTC6AT 1320 
401IKVT TSQ DML SIME 

1321 TTTTCTTATG ACCTTAACCT GTGTGGGCTG ACAGAGGACC CAGATCTTCA GGTTTCTGCG 1380 
421FSYD LNL CGL T E D P 

1381 A^CAGCACC AGACAGTGCT GGAACTGACA GAGACTGGGG TGGAGGCGGC ^CAGCCTCC 1440 
441 MQHQ TVL E L T * * « 

1*41 GCCATCTCTG ICOCC**^ CCTOCTC^ AGC^CCCri CCTCTTOXTi 1500 

461AISVARTLLVFEVUU 

1501 CTCTGGGACC AGCAGCACAA GTTCCCTGTC TTCATGGGGC GAGTATATGA CCCCAGGGCC 1360 
481LWDQ QHK FPV FMGR V 

1561 TGAGACCTGC AGGATCAGGT TAGGGCGAGC GCTACCTCTC CAGCCTCAGC TCTCAGTTGC 1620 
1621 AGCCCTGCTG CTGCCTGCCT GGACTTGCCC CTGCCACCTC CTGCCTCAGG TGTCCGCTAT 1680 
1681 CCACCAAAAG GGCTCCTGAG GGTCTGGGCA AGGGACCTGC TTCTATTAGC CCTTCTCCAT 1740 
1741 GGCCCTGCCA TGCTCTCCAA ACCACTTTrr GCAGCTTTCT CTAGTTCAAG TTCACCAGAC 1800 



1801 TCTATAAATA AAACCTGACA GACCAT 1826 



FIG. 16 

1 TCCTCCACCT GCTGGCCCCT GGACACCTCT GTCACCATGT GGTTCCTGGT TCTGTGCCTC 60 
x MWFLVLCL8 

61 GCCCTGTCCC TGGGGGGGAC TGGTGCTGCG CCCCCGATTC AGTCCCGGAT TGTGGGAGGC 120 
9ALSL GOT GAA PPIQ SRI VGG 28 

121 TGGGAGTGTG AGCAGCATTC CCAGCCCTGG CAGGCGGCTC TGTACCATTT CAGCACTTTC 180 
29WECE QHS QPW QAAL YHF S T F 48 

181 CAGTGTGGGG GCATCCTGGT GCACCGCCAG TGGGTGCTCA CAGCTGCTCA TTGCATCAGC 240 
49Q CGG I L V HRQ WVLT AAH CIS 68 

241 GACAATTACC AGCTCTGGCT GGGTCGCCAC AACTTGTTTG ACGACGAAAA CACAGCCCAG 300 
69DNYQ LWL GRH NLFD DEN TAQ 88 

301 TTTGTTCATG TCAGTGAGAG CTTCCCACAC CCTGGCTTCA ACATGAGCCT CCTGGAGAAC 360 
89FVHV SES FPH PGFN MSL L EN 108 

361 CACACCCGCC AAGCAGACGA GGACTACAGC CACGACCTCA TGCTGCTCCG CCTGACAGAG 420 
109HTRQ ADE DYS HDLM LLR 128 

421 CCTGCTGATA CCATCACAGA TGCTGTGAAG GTCGTGGAGT TGCCCACCGA GGAACCCGAA 480 
129PADT ITD AVK VVEL PTE EPE 148 

481 GTGGGGAGCA CCTGTTTGGC TTCCGGCTGG GGCAGCATCG AACCAGAGAA TTTCTCATTT 540 
149VGST CLA SGW GSIE PEN lbS 

541 CCAGATGATC TCCAGTGTGT GGACCTCAAA ATCCTGCCTA ATGATGAGTG CAAAAAAGCC 600 
169PDDL QCV DLK ILPN DEC 188 

601 CACGTCCAGA AGGTGACAGA CTTCATGCTG TGTGTCGGAC ACCTGGAAGG TGGCAAAGAC 660 
189HVQK VTD FML CVGH LEG 208 

661 ACCTGTGTGG GTGATTCAGG GGGCCCGCTG ATGTGTGATG GTGTGCTCCA AGGTGTCACA 720 
209 TCVG DSG GPL MCDG VLQ GVT 

721 TCATGGGGCT ACGTCCCTTG TGGCACCCCC AATAAGCCTT CTGTCGCCGT CAGAGTGCTG 780 
229 SWGY VPC GTP NKPS VAV 248 

781 TCTTATGTGA AGTGGATCGA GGACACCATA GCGGAGAACT CCTGAACGCC CAGCCCTGTC 840 
249 SYVK WIE DTI AENS ^ 



841 CCCTACCCCC AGTAAAATCA AATGTGCATC C 871 



FIG. 17 



1 TCCTCCACCT GCTGGCCCCT GGACACCTCT GTCACCATGT GGTTCCTGGT TCTGTGCCTC 60 

MWFLVLCL8 



* ' * 

61 GCCCTGTCCC TGGGGGGGAC TGGTGCTGCG CCCCCGATTC AGTCCCGGAT TGTGGGAGGC 120 

9ALSL GGT G A A PPIQ SRI VGG 28 

121 TGGGAGTGTG AGCAGCATTC CCAGCCCTGG CAGGCGGCTC TGTACCATTT CAGCACTTTC 180 

29WECE QHS QP W QAAL YHF STF 48 



181 CAGTGTGGGG GCATCCTGGT GCACCGCCAG TGGGTGCTCA CAGCTGCTCA TTGCATCAGC 240 
49QCGG ILV HRQ WVLT AAH CIS 68 

241 GACAATTACC AGCTCTGGCT GGGTCGCCAC AACTTGTTTG ACGACGAAAA CACAGCCCAG 300 
69 D N Y Q LWL GRH N L F D DEN TAQ 88 

301 TTTGTTCATG TCAGTGAGAG CTTCCCACAC CCTGGCTTCA ACATGAGCCT CCTGGAGAAC 360 
89FVHV SES FPH PGFN MSL LEN 108 



361 CACACCCGCC AAGCAGACGA GGACTACAGC CACGACCTCA TGCTGCTCCG CCTGACAGAG 420 
109HTRQ ADE DYS HDLM LLR LTE 128 



421 CCTGCTGATA CCATCACAGA TGCTGTGAAG GTCGTGGAGT TGCCCACCGA GGAACCCGAA 480 
129PADT ITD A V K VVEL PTE EPE 148 



481 GTGGGGAGCA CCTGTTTGGC TTCCGGCTGG GGCAGCATCG AACCAGAGAA TTTCTCATTT 540 
149VGST CLA SGW GSIE PEN FSF 168 

541 CCAGATGATC TCCAGTGTGT GGACCTCAAA ATCCTGCCTA ATGATGAGTG CGAAAAAGCC 600 
169PDDL QCV DLK ILPN DEC EKA 188 

601 CACGTCCAGA AGGTGACAGA CTTCATGCTG TGTGTCGGAC ACCTGGAAGG TGGCAAAGAC 660 
189HVQK VTD FML CVGH LEG GKD 208 

661 ACCTGTGTGG GTGATTCAGG GGGCCCGCTG ATGTGTGATG GTGTGCTCCA AGGTGTCACA 720 
209 TCVG DSG GPL MCDG VLQ GVT 228 

721 TCATGGGGCT ACGTCCCTTG TGGCACCCCC AATAAGCCTT CTGTCGCCGT CAGAGTGCTG 780 
229 SWGY VPC GTP NKPS VAV RVL 248 

781 TCTTATGTGA AGTGGATCGA GGACACCATA GCGGAGAACT CCTGAACGCC CAGCCCTGTC 840 
249 SYVK WIE DTI AENS * 262 



841 CCCTACCCCC AGTAAAATCA AATGTGCATC C 871 



FIG. 18 

1 TCCTCCACCT GCTGGCCCCT GGACACCTCT GTCACCATGT GGTTCCTGGT TCTGTGCCTC 60 
x MW FLV LCL 8 

• • * 

61 GCCCTGTCCC TGGGGGGGAC TGGTGCTGCG CCCCCGATTC AGTCCCGGAT TGTGGGAGGC 120 
9 A L S L GGT GAA PPIQ SRI VGG 28 

121 TGGGAGTGTG AGCAGCATTC CCAGCCCTGG CAGGCGGCTC TGTACCATTT CAGCACTTTC 180 
29 WE C E QHS QPW Q A A L YHF STF 48 

181 CAGTGTGGGG GCATCCTGGT GCACCGCCAG TGGGTGCTCA CAGCTGCTCA TTGCATCAGC 240 
49QCGG I L V HRQ W V L T AAH CIS 68 

241 GACAATTACC AGCTCTGGCT GGGTCGCCAC AACTTGTTTG ACGACGAAAA CACAGCCCAG 300 
69DNYQ L W L GRH N L F D DEN TAQ 88 

• 

301 TTTGTTCATG TCAGTGAGAG CTTCCCACAC CCTGGCTTCA ACATGAGCCT CCTGGAGAAC 360 
89FVHV SES FPH PGFN M S L L E N 108 

361 CACACCCGCC AAGCAGACGA GGACTACAGC CACGACCTCA TGCTGCTCCG CCTGACAGAG 420 
109HTRQ ADE DYS HDLM LLR LTE 128 

421 CCTGCTGATA CCATCACAGA TGCTGTGAAG GTCGTGGAGT TGCCCACCCA GGAACCCGAA 480 
129PADT ITD A V K VVEL PTg EPE 148 

481 GTGGGGAGCA CCTGTTTGGC TTCCGGCTGG GGCAGCATCG AACCAGAGAA TTTCTCATTT 540 
149VGST CLA SGW GSIE PEN FSF 168 

541 CCAGATGATC TCCAGTGTGT GGACCTCAAA ATCCTGCCTA ATGATGAGTG CAAAAAAGCC 600 
169PDDL QCV DLK ILPN DEC KKA 188 

601 CACGTCCAGA AGGTGACAGA CTTCATGCTG TGTGTCGGAC ACCTGGAAGG TGGCAAAGAC 660 
189HVQK VTD FML CVGH LEG GKD 208 

661 ACCTGTGTGG GTGATTCAGG GGGCCCGCTG ATGTGTGATG GTGTGCTCCA AGGTGTCACA 720 
209 TCVG DSG GPL MCDG VLQ GVT 228 

721 TCATGGGGCT ACGTCCCTTG TGGCACCCCC AATAAGCCTT CTGTCGCCGT CAGAGTGCTG 780 
229 SWGY VPC GTP NKPS VAV RVL 248 

• * * 

781 TCTTATGTGA AGTGGATCGA GGACACCATA GCGGAGAACT CCTGAACGCC CAGCCCTGTC 840 
249 SYVK WIE DTI AENS * 262 

841 CCCTACCCCC AGTAAAATCA AATGTGCATC C 871 
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1 CTGTGCATGG CATCATCCTG GCCCCCTCTA 
1 MASSWPPL 

• • • 

61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT 
19FPQN ATA CDN 

121 GTGCTGCCNA CATTTATCAT CTCCATCTGT 
39VLPT FII SIC 

• 

181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA 
59LLVF LLP RRQ 

241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG 
79LAAS DLV FVL 

301 AACCAGTTTA ACTGGCCTTT CGGAGCCCTC 
99NQFN WPF GAL 

361 GCCAATTTGT TCATCAGCAT CTTCCTGGTG 
119 ANLF ISI FLV 

421 CTGGTGCACC CTATGGCCAG CGGAAGGCAG 
139 LVHP MAS GRQ 

481 GTGCTCATCT GGGTTGTGGG GGGCCTCTTG 
159 VLIW VVG GLL 

541 CAAGCCGTCC CAGATCTGAA CATCACCGCC 
179 QAVP DLN ITA 

• • • 
601 CACTTTGCAA GGATTGTGGA GTTAAATATT 
199 HFAR IVE LNI 

• • • 
661 GTCTTCTTCA ACTACCACAT CCTGGCCTCC 
219 VFFN YHI LAS 

721 AGAGTGCNGG GGCCGAAGGA TAGCAAGACC 
239 RVXG PKD SKT 



781 TTCCTGGTCT GCTGGGCCCC TTACCACTTC 
259 FLVC WAP YHF 

, • • 

841 CAAGCAGTCC GAGGCTGCTT TTGGGAGGAC 
279 QAVR GCF WED 



• • * 

GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 60 
ELQS S N Q SQL 18 

GCTCCAGAAG CCTGGGACCT GCTGCACAGA 120 
APEA WDL LHR 38 

TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 180 
FFGL LGN LFV 58 

CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 
LNVA EIY LAN 78 



GGCTTGCCCT TCTGGGCAGA GAATATCTGG 300 
GLPF WAE NIW 98 



• • • 

CTCTGCCGTG TCATCAANGG GGTCATCAAG 360 
LCRV ING VIK 11S 



GTGGCCATCA GCCAGGACCG CTACCGCGTG 420 
VAIS QDR YRV 138 

CAGCGGCGGA GNCAGGCCCG GGTCACCTGC 480 
QRRR QAR VTC 158 

• • • 

AGCATCCCCA CATTCCTGCT GCGATCCATC 540 
SIPT FLL RSI 178 

TGCATCNTGC TCCTCCCCCA TGAGGCCTGG 600 
CIXL L PH EAW 198 



• • • 
CTGGGTTTCC TCCTACCACT GGCTGCGATC 660 
LGFL LPL A A I 218 

CTGCGAACGC GGGANGAGGT CAGCAGGACA 720 

LRTR EEV SRT 238 

• • • 
ACAGCGCTGA TCCTCACGCT CGTGGTTGCC 780 
TALI LTL VVA 258 



TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 
FAFL EFL FQV 278 



■ • • 

TTCATTGACC TGGGCCTGCA ATTGGCCAAC 900 
FIDL GLQ LAN 298 



FIG. 21B 



901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT 
299 FFAF TNS SLN 

• 

961 TTCAGGACCA AGGTCTGGGA ACTTTATAAA 
319 FRTK VWE LYK 

• • • 

1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA 
339 SSSH RKE IFQ 



CCAGTAATTT ATGTCTTTGT GGGCCNGCTC 960 
PVIY VFV GXL 318 



CAATGCACCC CTAAAAGTCT TGCTCCAATA 1020 
QCTP KSL API 338 

CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 
L F W R N * 353 



1081 CC 1082 



FIG. 22A 

1 AATTCAGAGC CACCGCGGGC AGGCGGGCAG TGCATCCAGA AGCGTTTATA TTCTGAGCGC 60 



61 CAGTTCAGCT TTCAAAAAGA GTGCTGCCCA TAAAAAGCCT TCCACCCTCC TGTCTGCTTT 120 



121 AGAAGGACCC TGAGCCCCAG GCGCCAGCCA CAGGACTCTG OTGCAGAGGG GGGTTGTGTA 180 



181 CAGATAGTAG GCTTTACGCC TAGCTTCGAA ATGGATAACG TCCTCCCGGT GGACTCAGAC 240 
1 MDNVLPVDSD10 



241 CTCTCCCCAA ACATCTCCAC TAACACCTCG GAACCCAATC AGTTCGTGCA ACCAGCCTGG 300 
11LSPN 1ST NTS EPNQ FVQ PAW 30 



301 CAAATTGTCC TTTGGGCAGC TGCCTACACG GTCATTGTGG TGACCTCTGT GGTGGGCAAC 360 
31 Q I V L WAA AYT VIVV TSV VGN 50 

361 GTGGTAGTGA TGTGGATCAT CTTAGCCCAC AAAAGAATGA GGACAGTGAC GAACTATTTT 420 
51 V V V M WII L A H KRMR TVT NYF 70 



421 CTGGTGAACC TGGCCTTCGC GGAGGCCTCC ATGGCTGCAT TCAATACAGT GGTGAACTTC 480 
71 L V N L AFA EAS MAAF NTV VNF 90 



481 ACCTATGCTG TCCACAACGA ATGGTACTAC GGCCTGTTCT ACTGCAAGTT CCACAACTTC 540 

91TYAV HNE WYY GLFY CKF HNF 110 

541 TTNCCCATCG CCGCTGTCTT CGCCAGTATC TACTCCATGA CGGCTGTGGC CTTTGATAGG 600 

111 F P I A A V F ASI YSMT AVA FDR 130 

601 TACATGGCCA TCATACATCC CCTCCAGCCC CGGCTGTCAG CCACAGCCAC CAAAGTGGTC 660 

131 Y M A I IHP L Q P R L S A TAT KVV 150 

661 ATCTGTGTCA TNTGGGTCCT GGCTCTCCTG CTGGCCTTCC CCCAGGGCTA CTACTCAACC 720 

151 I C V I WVL ALL LAFP QGY YST 170 



721 ACAGAGACCA TGCCCAGCAG AGTCGTGTGC ATGATCGAAT GGCCAGAGCA TCCGAACAAG 780 
171 TETM PSR VVC MIEW PEH PNK 190 

781 ATTTATGAGA AAGTGTACCA CATCTGTGTG ACTGTGCTGA TCTACTTCCT CCCCCTGCTG 840 
191 IYEK VYH ICV TVLI Y FL PLL 210 

841 GTGATTGGCT ATGCATACAC CGTAGTGGGA ATCACACTAT GGGCCAGTGA GATCCCCGGG 900 
211 VIGY AYT VVG ITLW ASE IPG 230 

901 GACTCCTCTG ACCGCTACCA CGAGCAAGTC TCTGCCAAGC GCAAGGTGGT CAAAATGATG 960 
231DSSD RYH EQV SAKR KVV KMM 250 



331 Y E G L 



FIG. 22B 

961 ATTGTCGTGG TGTGCACCTT CGCCATCTGC TGGCTGCCCT TCCACATCTT CTTCCTCCTG 1020 
251IVVV CTF AIC W L P F HIF F L L 270 

1021 CCCTACATCA ACCCAGATCT CTACCTGAAG AAGTTTATCC AGCAGGTCTA CCTGGCCATC 1080 
271PYIN P D L Y L K KFIQ QVY LAI 290 

1081 ATGTGGCTGG CCATGAGCTC CACCATGTAC AACCCCATCA TCTACTGCTG CCTCAATGAC 1140 
291 M W L A MSS TMY NP1I YCC 

1141 AGGTTCCGTC TGGGCTTCAA GCATGCCTTC CGGTGCTGCC CCTTCATCAG CGCCGGCGAC 1200 
311RFRL GFK HAF RCCP FIS 330 

1201 TATGAGGGGC TGGAAATGAA ATCCACCCGG TATCTCCAGA CCCAGGGCAG TGTGTACAAA 1260 
E M K S T R Y 1> Q T Q G S 

1261 GTCAGCCGCC TGGAGACCAC CATCTCCACA GTGGTGGGGG CCCACGAGGA GGAGCCAGAG 1320 
351VSRL ETT 1ST V VGA HEE 

1321 GACGGCCCCA AGGCCACACC CTCHTCCCTG GACCTGACCT CCAACTGCTC TTCACGAAGT 1380 
371DGPK ATP SSL DLTS NCS SRS J»U 

1381 GACTCCAAGA CCATGACAGA GAGCTTCAGC TTCTCCTCCA ATGTGCTCTC CTAGGCCACA 1440 
391DSKT MTE SFS FSSNVLS 

. • • 

1441 GGGCCTTTGG CAGGTGCAGC CCCCACTGCC TTTGACCTGC CTCCCTTCAT GCATGGAAAT 1500 
1501 TCCCTTCATC TGGAACCATC AGAAACACCC TCACACTGGG ACTTGCAAAA AGGGTCAGTA 1560 
1561 TGGGTTAGGG AAAACATTCC ATCCTTGAGT CAAAAAATCT CAATTCTTCC CTATCTTTGC 1620 
1621 CACCCTCATG CTGTGTGACT CAAACCAAAT CACTGAACTT TGCTGAGCCT GTAAAATAAA 1680 
1681 AGGTCGGACC AGCTTTTCCT CAAGAGCCCA ATGCATTCCA TTTCTGGAAG TGACTTTGGC 1740 



1741 TGCATGCGAG TGCTCATTTC AGGATG 1766 



FIG. 23A 



1 AGTCTGCACT GGAGCTGCCT GGTGACCAGA AOTTTGGAGT COGCTGACGT CGCCGCCCAG 60 
„ ATGGCCTCCA GGCTGACCCT GCTGAOCCTC ^CTOCTOC GGATAGAGCC g. 

11 rrrj ttt ttt 0 rrri ~t rrr s° 

G3CGAAGGGA AGO™ AACAGTTATC TCCAAGATGC TATTCGtfTGA ACCGATGCTG JJ. 

GAGGTTTCCA GC™= AACCAACTCA ACAACCAAT, CAGCCACCAA AAT AACAGCT 300 

30! AATACCACTG ATGAACCCAC CACACAACCC ACCAOAGAGC CCACCACCCA AOCGACCATC 360 
81NTTD EPT TQP TTEP T 1 U 

361 CAACCCACCC AACCAACTAC CCAGCTCCCA ACAGA^CTC CTACCCAGCC CACTACTGGG 420 
101QPTQ PTT QLP TDSP T«f 

421 TCC^PCTGCC CAGGACCTGT TACTCTCTGC TCTGACTTGG AGAGTCATTC AACAGAGGCC 480 

481 GTGTTGGGGG ATGCTTTGGT AGATTTCTCC CTGAAGCTCT ACCACGCCTT CTCAGHAATG 540 
141 V L G D A L V D F S L K L Y & 

541 AAGAAGGTGG AGACCAACAT GGCCTTTTCC CCATTCAGCA TCGCCAGCCT CCTTACCCAG 600 
161KKVE TNM AFS PFSI 

601 GTCCTGCTCG GGGCTGGGCA GAACACCAAA ACAAACCTGG AGAGCATCCT CTCTTACCCC 660 
181VLLG AGQ NTK TNLE Sin 

661 AAGGACTTCA CCTGTGTCCA CCAGGCCCOG AAGGGCTTCA CGACCAAAGG TGTCACCTCA 720 
201KDFT CVH Q A L KGFT l * v, 

721 GTCTCTCAGA TCTTCCACAG CCCAGACCTG GCCATAAGGG ACACCTTTGT GAATGCCTCT 780 
221VSQI FHS PDL AIRD Ttv 

£ OGGACCCTGT AGAGCAGCAG CCCCAGAGTC ^AAGCAACA ACAGTGACGG CAAC^GAG 040 

ss rrri cc rrr ttt < rrrs ~rs-r °rrr as 
is' rrrs -tst ttt° rr?^ sc rr? rrr ss 



FIG. 23B 

» oi ttt ?frs ic rrr °rrr »s° 

*s rrrs "rrr ttt rr?i -rr? ttt 0 x. m 

10.1 AAAGCCAAGG TCGGGCAGCT GCAGCTCTCC C^CMTCTGA QTTTGGTGAT CCTGGTACCC 1140 
341KAKVGQLQL SHNlj£>AJ 

1141 CAGAACCTGA AACATCGTCT TGAAGACATG GAACAGGCTC TCAGCCCTTC TGTTTTCAAG 1200 
361QNLK HRL EDM EQAL SPS 

1201 GCCATCATGG AGAAACTGGA GATGTCCAAG TTCCAGCCCA CTCTCCTAAC ACTACCCCGC 1260 
381 AIME KLE MSK FQPA 

1261 ATCAAAGTGA CGACCAGNCA GGATATGCTC TCAATCATGG AGAAATTGGA ATTCTTCGAT 1320 
401 IKVT TSQ DML SIME K Ii * 

1321 TTTTCTTATG ACC^TAACCT GTGTGGGCTG ACAGAGGACC CAGATCTTCA GGTTTCTGCG 1380 
421 FSYD LNL CGL. TEDP DLQ V o 

1381 ATGCAGCACC AGACAGTGCT GGAACTGACA OAGACTGGGG TOOAOOCGOC TGCAGCCTCC 1440 
441MQHQ TVL ELT ETGV 

Xjg GCC™ TGGCCCGCAC CC^C JTTGAAGTGC AGCAGCCCJ, C^TG 1500 

1501 CTC^GGACC AGCAGCACAA G^CC^TC ^TCATGGGGC GAGTATATGA CCCCAGGGCC 1560 

1561 TGAGACCTGC AGGATCAGGT TAGGGCGAGC GCTACCTCTC CAGCCTCAGC TCTCAGTTGC 1620 
1621 AGCCCTGCTG CTGCCTGCCT GGACTTGCCC CTGCCACCTC CTGCCTCAGG TGTCCGCTAT 1680 
1681 CCACCAAAAG GGCTCCTGAG GGTCTGGGCA AGGGACCTGC TTCTATTAGC CCTTCTCCAT 1740 
1741 GGCCCTGCCA TGCTCTCCAA ACCACTTTTT GCAGCTTTCT CTAGTTCAAG TTCACCAGAC 1800 



* 

1801 TCTATAAATA AAACCTGACA GACCAT 1826 



FIG. 24 

* • * 

1 TCCTCCACCT GCTGGCCCCT GGACACCTCT GTCACCATGT GGTTCCTGGT TCTGTGCCTC 60 
x MWFLVLCL8 

61 GCCCTGTCCC TGGGGGGGAC TGGTGCTGCG CCCCCGATTC AGTCCCGGAT TGTGGGAGGC 120 
9ALSL GGT G A A PPIQ SRI VGG 28 

121 TGGGAGTGTG AGCAGCATTC CCAGCCCTGG CAGGCGGCTC TGTACCATTT CAGCACTTTC 180 
29WECE QHS QPW QAAL YHF STF 48 

m 

181 CAGTGTGGGG GCATCCTGGT GCACCGCCAG TGGGTGCTCA CAGCTGCTCA TTGCATCAGC 240 
49QCGG I L V HRQ WVLT AAH CIS 68 

241 GACAATTACC AGCTCTGGCT GGGTCGCCAC AACTTGTTTG ACGACGAAAA CACAGCCCAG 300 
69 D N Y Q LWL GRH N L F D DEN TAQ 88 

301 TTTGTTCATG TCAGTGAGAG CTTCCCACAC CCTGGCTTCA ACATGAGCCT CCTGGAGAAC 360 
89FVHV SES FPH PGFN M S L It E N 108 

361 CACACCCGCC AAGCAGACGA GGACTACAGC CACGACCTCA TGCTGCTCCG CCTGACAGAG 420 
109HTRQ ADE DYS HDLM LLR LTE 128 

. • • * 

421 CCTGCTGATA CCATCACAGA TGCTGTGAAG GTCGTGGAGT TGCCCACCNA GGAACCCGAA 480 
129PADT ITD A V K VVEL PTX EPE 148 

481 GTGGGGAGCA CCTGTTTGGC TTCCGGCTGG GGCAGCATCG AACCAGAGAA TTTCTCATTT 540 
149VGST C L A SGW GSIE PEN FSF 168 

• * * 

541 CCAGATGATC TCCAGTGTGT GGACCTCAAA ATCCTGCCTA ATGATGAGTG CMAAAAAGCC 600 
169PDDL QCV DLK ILPN DEC 3CKA 188 

• • * 

601 CACGTCCAGA AGGTGACAGA CTTCATGCTG TGTGTCGGAC ACCTGGAAGG TGGCAAAGAC 660 
189HVQK VTD FML CVGH LEG GKD 208 

661 ACCTGTGTGG GTGATTCAGG GGGCCCGCTG ATGTGTGATG GTGTGCTCCA AGGTGTCACA 720 
209 T CVG DSG GPL MCDG VLQ GVT 228 

721 TCATGGGGCT ACGTCCCTTG TGGCACCCCC AATAAGCCTT CTGTCGCCGT CAGAGTGCTG 780 
229 SWGY VPC GTP NKPS VAV RVL 248 

781 TCTTATGTGA AGTGGATCGA GGACACCATA GCGGAGAACT CCTGAACGCC CAGCCCTGTC 840 
249 SYVK WIE DTI AENS * 262 



841 CCCTACCCCC AGTAAAATCA AATGTGCATC C 871 



FIG. 25A 

1 CTGTGCATGG CATCATCCTG GCCCCCTCTA GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 60 
1 MA SSWPPLELQSSNQSQL18 

61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT GCTCCAGAAG CCTGGGACCT GCTGCACAGA 120 
19FPQN ATA CDN A P E A W D L 38 

121 GTGCTGCCGA CATTTATCAT CTCCATCTGT TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 180 
39VLPT FII SIC F F G L L G N 58 

181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 
59LLVF LLP RRQ L N V A EIY LAN 78 



241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG GGCTTGCCCT TCTGGGCAGA GAATATCTGG 300 
79LAAS DLV FVL GLPF W A E 98 

301 AACCAGTTTA ACTGGCCTTT CGGAGCCCTC CTCTGCCGTG TCATCAATGG GGTCATCAAG 360 
99NQFN WPF GAL LCRV ING VIK 11B 



361 GCCAATTTGT TCATCAGCAT CTTCCTGGTG GTGGCCATCA GCCAGGACCG CTACCGCGTG 420 
119ANLF ISI FLV VAIS QDR 138 

■ 

421 CTGGTGCACC CTATGGCCAG CGGAAGGCAG CAGCGGCGGA GGCAGGCCCG GGTCACCTGC 480 
139LVHP MAS GRQ Q R R R QAR VTC 158 

481 GTGCTCATCT GGGTTGTGGG GGGCCTCTTG AGCATCCCCA CATTCCTGCT GCGATCCATC 540 
159VLIW VVG GLL SIPT FLL RSI 178 

541 CAAGCCGTCC CAGATCTGAA CATCACCGCC TGCATCCTGC TCCTCCCCCA TGAGGCCTGG 600 
179 Q A V P DLN ITA CILL LPH 

601 CACTTTGCAA GGATTGTGGA GTTAAATATT CTGGGTTTCC TCCTACCACT GGCTGCGATC 660 
199HFAR IVE LNI LGFL LPL A 



661 GTCTTCTTCA ACTACCACAT CCTGGCCTCC CTGCGAACGC GGGAGGAGGT CAGCAGGACA 720 
219VFFN YHI LAS LRTR 



E V S R T 238 



721 AGAGTGCGGG GGCCGAAGGA TAGCAAGACC ACAGCGCTGA TCCTCACGCT CGTGGTTGCC 780 
239 RVRG PKD SKT TALI LTL VVA 258 

781 TTCCTGGTCT GCTGGGCCCC TTACCACTTC TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 
259 FLVC WAP YHF FAFL EFL FQV 278 

841 CAAGCAGTCC GAGGCTGCTT TTGGGAGGAC TTCATTGACC TGGGCCTGCA ATTGGCCAAC 900 
279 Q.A VR GCF WED FIDL GLQ LAN 298 



FIG. 25B 



901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT 
299 FFAF TNS SLN 

961 TTCAGGACCA AGGTCTGGGA ACTTTATAAA 
319FRTK VWE LYK 

. . • 

1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA 
339 SSSH RKE IFQ 



CCAGTAATTT ATGTCTTTGT GGGCCGGCTC 960 
PVIY VFV GRL 318 



CAATGCACCC CTAAAAGTCT TGCTCCAATA 1020 
QCTP KSL API 338 

CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 
L F W R N * 353 



1081 CC 1082 



FIG. 26A 

1 CTGTGCATGG CATCATCCTG GCCCCCTCTA GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 60 
1 MA SSW PPL ELQS SNQ SQL 18 

61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT GCTCCAGAAG CCTGGGACCT GCTGCACAGA 120 
19FPQN ATA C D N APEA WDL LHR 38 

121 GTGCTGCCGA CATTTATCAT CTCCATCTGT TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 180 
39VLPT FII SIC FFGL LGN LFV 58 

181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 
59LLVF LLP R R Q LNVA EIY LAN 78 

241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG GGCTTGCCCT TCTGGGCAGA GAATATCTGG 300 
79LAAS DLV FVL GLPF WAE NIW 98 

301 AACCAGTTTA ACTGGCCTTT CGGAGCCCTC CTCTGCCGTG TCATCAACGG GGTCATCAAG 360 
99NQFN WPF GAL L CRV ING VIK 118 

361 GCCAATTTGT TCATCAGCAT CTTCCTGGTG GTGGCCATCA GCCAGGACCG CTACCGCGTG 420 
119ANLF ISI FLV VAIS QDR YRV 138 

421 CTGGTGC AC C CTATGGCCAG CGGAAGGCAG CAGCGGCGGA GACAGGCCCG GGTCACCTGC 480 
139LVHP MAS GRQ Q R R R QAR VTC 158 

481 GTGCTCATCT GGGTTGTGGG GGGCCTCTTG AGCATCCCCA CATTCCTGCT GCGATCCATC 540 
159VLIW VVG GLL SIPT FLL RSI 178 

541 CAAGCCGTCC CAGATCTGAA CATCACCGCC TGCATCCTGC TCCTCCCCCA TGAGGCCTGG 600 
179 Q A V P D L N ITA CILL LPH EAW 198 

601 CACTTTGCAA GGATTGTGGA GTTAAATATT CTGGGTTTCC TCCTACCACT GGCTGCGATC 660 
199HFAR IVE LNI LGFL LPL 218 

661 GTCTTCTTCA ACTACCACAT CCTGGCCTCC CTGCGAACGC GGGAGGAGGT CAGCAGGACA 720 
219VFFN YHI LAS LRTR EEV SRT 238 

721 AGAGTGCGGG GGCCGAAGGA TAGCAAGACC ACAGCGCTGA TCCTCACGCT CGTGGTTGCC 780 
239 RVRG PKD SKT TALI LTL VVA 258 

781 TTCCTGGTCT GCTGGGCCCC TTACCACTTC TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 
259 FLVC WAP YHF FAFL EFL FQV 278 

841 CAAGCAGTCC GAGGCTGCTT TTGGGAGGAC TTCATTGACC TGGGCCTGCA ATTGGCCAAC 900 
279 Q A V R GCF WED FIDL GLQ LAN 298 



FIG. 26B 



901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT 
299 FFAF TNS SLN 

961 TTCAGGACCA AGGTCTGGGA ACTTTATAAA 
319FRTK VWE LYK 

• • • 

1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA 
339 SSSH RKE IFQ 



CCAGTAATTT ATGTCTTTGT GGGCCGGCTC 960 
PVIY VFV GRL 318 



• • • 

CAATGCACCC CTAAAAGTCT TGCTCCAATA 1020 
QCTP KSL API 338 

CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 
L F W R N * 353 



1081 CC 1082 



FIG. 27A 

1 CTGTGCATGG CATCATCCTG GCCCCCTCTA GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 60 
1 M ASSWPPLELQSSNQSQL18 

. • • 

61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT GCTCCAGAAG CCTGGGACCT GCTGCACAGA 120 
19FPQN ATA CDN APEA W D L L H R 38 

121 GTGCTGCCGA CATTTATCAT CTCCATCTGT TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 180 
39 V L P T FII SIC FFGL L G N L F V 58 

181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 
59 L L V F LLP RRQ LNVA EIY LAN 78 

241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG GGCTTGCCCT TCTGGGCAGA GAATATCTGG 300 
79LAAS DLV FVL GLPF WAE 98 

301 AACCAGTTTA ACTGGCCTTT CGGAGCCCTC CTCTGCCGTG TCATCAACGG GGTCATCAAG 360 
99NQFN WPF GAL LCRV ING VIK 1XB 



361 GCCAATTTGT TCATCAGCAT CTTCCTGGTG GTGGCCATCA GCCAGGACCG CTACCGCGTG 420 
119ANLF 1SI FLV VAIS QDR 138 

421 CTGGTGCACC CTATGGCCAG CGGAAGGCAG CAGCGGCGGA GGCAGGCCCG GGTCACCTGC 480 
139 L V H P MAS G R Q Q R R R Q A R V T C 1S« 



481 GTGCTCATCT GGGTTGTGGG GGGCCTCTTG AGCATCCCCA CATTCCTGCT GCGATCCATC 540 
159VLIW VVG GLL SIPT FLL RSI 

541 CAAGCCGTCC CAGATCTGAA CATCACCGCC TGCATCGTGC TCCTCCCCCA TGAGGCCTGG 600 
179 Q A V P DLN ITA CIVL LPH 

601 CACTTTGCAA GGATTGTGGA GTTAAATATT CTGGGTTTCC TCCTACCACT GGCTGCGATC 660 
199HFAR IVE LNI LGFL LPL 218 

661 GTCTTCTTCA ACTACCACAT CCTGGCCTCC CTGCGAACGC GGGAGGAGGT CAGCAGGACA 720 
219VFFN YHI LAS LRTR EEV 

721 AGAGTGCGGG GGCCGAAGGA TAGCAAGACC ACAGCGCTGA TCCTCACGCT CGTGGTTGCC 780 
239 RVRG PKD SKT TALI LTL VVA 258 

781 TTCCTGGTCT GCTGGGCCCC TTACCACTTC TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 
259 FLVC WAP YHF FAFL EFL FQV 278 

841 CAAGCAGTCC GAGGCTGCTT TTGGGAGGAC TTCATTGACC TGGGCCTGCA ATTGGCCAAC 900 
279 Q A V R GCF WED FIDL GLQ LAN 29» 



FIG. 27B 



901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT 
299 FFAF TNS SLN 

961 TTCAGGACCA AGGTCTGGGA ACTTTATAAA 
319 FRTK VWE LYK 

1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA 
339 SSSH RKE IFQ 



CCAGTAATTT ATGTCTTTGT GGGCCGGCTC 960 
PVIY VFV GRL 318 



CAATGCACCC CTAAAAGTCT TGCTCCAATA 1020 
QCTP KSL API 338 

• • * 

CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 
L F W R N * 353 



1081 CC 1082 , 



FIG. 28A 



1 CTGTGCATGG CATCATCCTG GCCCCCTCTA GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 60 
1 MA SSW PPL E L Q S SNQ SQL 18 

61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT GCTCCAGAAG CCTGGGACCT GCTGCACAGA 120 
19 F P Q N ATA CDN APEA WDL 38 

121 GTGCTGCCGA CATTTATCAT CTCCATCTGT TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 180 
39VLPT FIX SIC FFGL LGN 

181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 
59 L L V F LLP R R Q LNVA 

241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG GGCTTGCCCT TCTGGGCAGA GAATATCTGG 300 
79LAAS DLV FVL GLPF 

301 AACCAGTTTA ACTGGCCTTT CGGAGCCCTC CTCTGCCGTG TCATCAACGG GGTCATCAAG 360 
99 N Q F N W P F GAL L C R V x . .. . 

361 GCCAATTTGT TCATCAGCAT CrTCCTGGTG GTGGCCATCA ^CAGGACCG CTACCGCGT3 420 

421 CTGGTGCACC CTATGGCCAG CGGAAGGCAG CAGCGGCGGA «=CA^CG GGTCACCTGC 480 
139LVHP MAS GRQ Q R R R QAR 

481 GTGCTCATCT GGGTTGTGGG GGGCCTCTTG AGCATCCCCA CATTCCTGCT GCGATCCATC 540 
159 VLIW VVG GLL SIPT Fiii- „ 

541 CAAGCCGTCC CAGATCTGAA CATCACCGCC TGCATCCTGC TCCTCCCCCA TGAGGCCTGG 600 

601 CACTTTGCAA GGATTGTGGA GTTAAATATT CTGGGTTTCC TCCTACCACT GGCTGCGATC 660 
199HFAR IVE LNI LGFL 

661 GTCTTCTTCA ACTACCACAT CCTGGCCTCC CTGCGAACGC GGGAGGAGGT CAGCAGGACA 720 
219VFFN YHI LAS LRTR KEV 

721 AGAGTGCGGG GGCCGAAGGA TAGCAAGACC ACAGCGCTGA TCCTCACGCT CGTGGT^CC 780 
239 RVRG PKD SKT TALI 

781 TTCCTGGTCT GCTGGGCCCC TTACCACTTC TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 
259 FLVC WAP YHF FAFL 

841 CAAGCAGTCC GAGGCTGCTT TTGGGAGGAC TTCATTGACC TGGGCCTGCA ATTGGCCAAC 900 
279 Q A V R GCF WED FIDL GLQ LAN 



FIG. 28B 



901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT CCAGTAATTT ATGTCTTTGT GGGCCGGCTC 960 
299 FFAF TNS SLN PVIY VFV GRL 318 

961 TTCAGGACCA AGGTCTGGGA ACTTTATAAA CAATGCACCC CTAAAAGTCT TGCTCCAATA 1020 
319FRTK VWE LYK QCTP KSL API 338 

1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 
339 SSSH RKE IFQ LFWR N * 353 



1081 CC 1082 



FIG. 29 A 



1 ATGTTCTCTC CCTGGAAGAT ATCAATGTTT CTGTCTGTTT GTGAGGACTC CGTGCCCACC 60 
1MFSP WKI SMF LSVC EDS VPT 20 

61 ACGGCCTCTT TCAGCGCCGA CATGCTCAAT GTCACCTTGC AAGGGCCCAC TCTTAACGGG 120 
21TASF SAD MLN VTLQ GPT LNG 40 

121 ACCTTTGCCC AGAGCAAATG CCCCCAAGTG GAGTGGCTGG GCTGGCTCAA CACCATCCAG 180 
41 T F A Q SKC PQV EWLG WLN TIQ 60 

181 CCCCCCTTCC TCTGGGTGCT GTTCGTGCTG GCCACCCTAG AGAACATCTT TGTCCTCAGC 240 
61PPFL WVL FVL ATLE NIF VLS 80 

. • • 

241 GTCTTCTGCC TGCACAAGAG CAGCTGCACG GTGGCAGAGA TCTACCTGGG GAACCTGGCC 300 
81VFCL HKS SCT VAEI Y L G NLA 100 

301 GCAGCAGACC TGATCCTGGC CTGCGGGCTG CCCTTCTGGG CCATCACCAT CTCCAACAAC 360 
101 A A D L I L A CGL P F W A ITI SNN 120 

361 TTCGACTGGC TCTTTGGGGA GACGCTCTGC CGCGTGGTGA ATGCCATTAT CTCCATGAAC 420 
121FDWL FGE TLC RVVN All SMN 140 

421 CTGTACAGCA GCATCTGTTT CCTGATGCTG GTGAGCATCG ACCGCTACCT GGCCCTGGTG 480 
141LYSS ICF LML VSID RYL ALV 160 

481 AAAACCATGT CCATGGGCCG GATGCGCGGC GTGCGCTGGG CCAAGCTCTA CAGCTTGGTG 540 
161KTMS MGR MRG VRWA KLY SLV 180 

. • • 

541 ATCTGGGGGT GTACGCTGCT CCTGAGCTCA CCCATGCTGG TGTTCCGGAC CATGAAGGAG 600 
181IWGC TLL LSS PMLV FRT MKE 200 

601 TACAGCGATG AGGGCCACAA CGTCACCGCT TGTGTCATCA GCTACCCATC CCTCATCTGG 660 
201YSDE GHN VTA CVIS YPS LIW 220 

661 GAAGTGTTCA CCAACATGCT CCTGAATGTC GTGGGCTTCC TGCTGCCCCT GAGTGTCATC 720 
221EVFT NML LNV VGFL LPL SVI 240 

721 ACCTTCTGCA CGATGCAGAT CATGCAGGTG CTGCGGAACA ACGAGATGCA GAAGTTCAAG 780 
241TFCT M Q I MQV LRNN EMQ KFK 260 

781 GAGATCCAGA CGGAGAGGAG GGCCACGGTG CTAGTCCTGG TTGTGCTGCT GCTATTCATC 840 
261EIQT ERR ATV LVLV VLL LFI 280 

841 ATCTGCTGGC TGCCCTTCCA GATCAGCACC TTCCTGGATA CGCTGCATCG CCTCGGCATC 900 
281ICWL PFQ 1ST F 1> D T LHR LGI 300 



FIG. 29B 

901 CTCTCCAGCT GCCAGGACGA GCGCATCATC GATGTAATCA CACAGATCGC CTCCTTCATG 960 
301LSSC QDE R I I DVIT QIA SFM 320 

961 GCCTACAGCA ACAGCTGCCT CAACCCACTG GTGTACGTGA TCGTGGGCAA GCGCTTCCGA 1020 
321AYSN SCL NPL VYVI VGK RFR 340 

1021 AAGAAGTCTT GGGAGGTGTA CCAGGGAGTG TGCCAGAAAG GGGGCTGCAG GTCAGAACCC 1080 
341KKSW EVY QGV CQKG GCR SEP 360 

1081 ATTCAGATGG AGAACTCCAT GGGCACACTG CGGACGTCCA TCTCCGTGGA ACGCCAGATT 1140 
361IQME NSM GTL RTSI SVE R Q I 380 

1141 CACAAACTGC AGGACTGGGC AGGGAGCAGA CAGTGAGCAA ACGCCAGCAG GGCTGCTGTG 1200 
381 HKLQ DWA GSR Q * 391 

1201 AATTTGTGTA AGGATTGAGG GACAGTTGCT TTTCAGCATG GGCCCAGGAA TGCCAAGGAG 1260 
1261 ACATCTATGC ACGACCTTGG GAAATGAGTT GATGTCTCCG GTAAAACACC GGAGACTAAT 1320 
1321 TCCTGCCCTG CCCAATTTTG CAGGGAGCAT GGCTGTGAGG ATGGGGTGAA CTCACGCACA 1380 
1381 GCCAAGGACT CCAAAATCAC AACAGCATTA CTGTTCTTAT TTGCTGCCAC ACCTGAGCCA 1440 

m 

1441 GCCTGCTCCT TCCCAGGAGT GGAGGAGGCC TGGGGGGAGG GAGAGGAGTG ACTGAGCTTC 1500 
1501 CCTCCCGTGT GTTCTCCGTC CCTGCCCCAG CAAGACAACT TAGATCTCCA GGAGAACTGC 1560 
1561 CATCCAGCTT TGGTGCAATG GCTGAGTGCA CAAGTGAGTT GTTGCCCTGG GTTTCTTTAA 1620 
1621 TCTATTCAGC TAGAACTTTG AAGGACAATT TCTTGCATTA ATAAAGGTTA AGCCCTGAGG 1680 
1681 GGTCCCTGAT AACAACCTGG AGACCAGGAT TTTATGGCTC CCCTCACTGA TGGACAAGGA 1740 

m 

1741 GGTCTGTGCC AAAGAAGAAT CCAATAAGCA CATATTGAGC ACTTGCTGTA TATGCAGTAT 1800 
1801 TGAGCACTGT AGGCAAGACC CAAGAAAGAG AAGGAGCCAT CTCCATCTTG AAGGAACTCA 1860 
1861 AAGACTCAAG TGGGAACGAC TGGGCACTGC CACCACCAGA AAGCTGTTCG ACGAGACGGT 1920 
1921 CGAGCAGGGT GCTGTGGGTG ATATGGACAG CAGAAGGGGG AGACCAAGGT TCCAGCTCAA 1980 
1981 CCAATAACTA TTGCACAACC ACCTGTCCCT GCCTCAGTTC CCTTTTATGT AACATGAAGT 2040 



FIG. 29C 



2041 CGTTGTGAGG GTTAAAGGCA GTAACAGGTA TAAAGTACTT AGAAAAGCAA AGGGTGCTAC 2100 



2101 GTACATGTGA 



2161 ACTGAGGTCT 



2221 AAGCACCAGT 



2281 CCACCCTGAG 



2341 AAAGTCTGAT 



2401 AC AGTGC TGA 



2461 TCAGGGACTG 



2521 GAAGGTGGCC 



2581 TCGGCAGTGC 



2641 GAGCTAGAAC 



GGCATCATTA CGCAGACGTA ACTGGGATAT GTTTACTATA AGGAAAAGAC 2160 



AGAAATAGCT CCGTGGAGCA GAATCAGTAT TGGGAGCCGG TGGCGGTGTG 2220 



GTCTGGCACA CAGTAGGTGC TCATTGGCTC CCTTCCACCT GTCATTCCCA 2280 



GCCCCAACCG CCACACACAC AGGAGCATTT GGAGAGAAGG CCATGTCTTC 2340 



TTGTGATGAG GCAGAGGAAG ATATTTCTAA TCGGTCTTGC CCAGAGGATC 2400 



GACCCCCCAC CACCAGCCGG TACCTGGGAA GGGGGAGAGT GCAGGCCTGC 2460 



TTCCTGTCTC AGCAACCAAG GGATTGTTCC TGTCAATCAA TGGTTTATTG 2520 



CAGTATGAGC CCTAGAAGAG TGTGAAAAGG AATGGCAATG GTGTTCACCA 2580 



CAGGGCAGCA CTCATTCACT TGATAAATGA ATATTTATTA GCTGGTTGGA 2640 



CTGGAGAGCT AGAACCTGGA GAACTAGAAC CTGGAGGGCT AGAACCTGGA 2700 



2701 GAGGCTAGAA CCAAGAAGGG CTAGAAC CTG GAGGGGCTAG AACCTAGAGA AGCTAAAACC 2760 



2761 TGAGCTAGAA GCTGGAGGAC TAGAACCTGG AGGGCTGGAA TCTGAAGGGC TAGAACCTGG 2820 



2821 AGGGCTGGAA TCTGGAGAGC TAGAACCTGG AGGGCTAGAA CCTGGAGGGC TAGAACCTAG 2880 



2881 AAGGGCTAGA ACCTGGAGGG CTGGAATCTG GAGAGCTAGA ACCTGGAGGG CTAGAACCTG 2940 



2941 GAGGGCTAGA ACCTAGAAGG GCTAGAACCT GGAGGGCTAG AACCTGGCAG GTTAGAACCT 3000 



3001 AGAAGGGCTA GAACCTGGAG AGCCAGAACC TGGAGGGCTA GAACCTGGAA GGGCTAGAAC 3060 



3061 CTGTAGAGCT AGAACATGGA GAGCTAGAAC CCGGCAGGCT AGAACCTGGC AAGCTAGAAC 3120 



3121 CTGGAGGGAA TGAACCTGGA GGGCTAGAAC CTGGAGAATG AGAAAAATTT ACATGGCAAA 3180 



3181 GAGCCCATAA ATCCTGACCA ATCCAACTCT GAATTTTAAA GCAAAAGCGT GAAAAAAAAG 3240 



FIG. 29D 

3241 ATTCCCTCCT TACCCCCAAC CCACTCTTTT TTCCCACCAC CCACTCTCCT CTGCCTCAGT 3300 
3301 AAGTATCTGG AGGAAGAAAA CAGGTGAAAG AAGAAGTAAA AACCATTTAG TATTAGTATT 3360 
3361 AGAATGAAGT CAAACTGTGC CACACATGGT GAATGAAAAA AAAAAAAAAG AGGCTGTGTT 3420 
3421 TTGTCACACA GGGCAGTCAT TCAGCACCAG AGCACGTGAT GGTCTGAGAC TCTCTTAGGA 3480 
3481 GCAGAGCTCT GCCGCAATGG CCATGTGGGG ATCCACACCT GGTCTGAGGG GCAACTGAGT 3540 
3541 CTGCGGGAGA AGAGCGGCCC TATGCATGGT GTAGATGCCC TGATAAAGAA CATCTGTCCT 3600 
3601 GTGAAAGACT CAATGAGCTG TTATGTTGTA AACAGGAAGC ATTTCACATC CAAACGAGAA 3660 
3661 AATCATGTAA ACATGTGTCT TTTCTGTAGA GCATAATAAA TGGATGAGGT TTTTGCAAAA 3720 
3721 AAAAAAAAAA AAA 3733 



FIG. 30A 



1 ATGTTCTCTC CCTGGAAGAT ATCAATGTTT CTGTCTGTTC GTGAGGACTC CGTGCCCACC 60 
1MFSP WKI SMF LSVR EDS VPT 20 

61 ACGGCCTCTT TCAGCGCCGA CATGCTCAAT GTCACCTTGC AAGGGCCCAC TCTTAACGGG 120 
21TASF SAD MLN VTLQ GPT L N G 40 

121 ACCTTTGCCC AGAGCAAATG CCCCCAAGTG GAGTGGCTGG GCTGGCTCAA CACCATCCAG 180 
41TFAQ SKC PQV EWLG WLN TIQ 60 



181 CCCCCCTTCC TCTGGGTGCT GTTCGTGCTG GCCACCCTAG AGAACATCTT TGTCCTCAGC 240 
61PPFL WVL FVL ATLE NIF VLS 80 

• ••••• 

241 GTCTTCTGCC TGCACAAGAG CAGCTGCACG GTGGCAGAGA TCTACCTGGG GAACCTGGCC 300 
81VFCL HKS SCT VAEI YLG NLA 100 



301 GCAGCAGACC TGATCCTGGC CTGCGGGCTG CCCTTCTGGG CCATCACCAT CTCCAACAAC 360 
101 AADL I L A CGL PFWA ITI SNN 120 



361 TTCGACTGGC TCTTTGGGGA GACGCTCTGC CGCGTGGTGA ATGCCATTAT CTCCATGAAC 420 
121 FDWL FGE TLC RVVN All SMN 140 



• • • 

421 CTGTACAGCA GCATCTGTTT CCTGATGCTG GTGAGCATCG ACCGCTACCT GGCCCTGGTG 480 
141 LYSS ICF LML VSID RYL A L V 160 



481 AAAACCATGT CCATGGGCCG GATGC GCGGC GTGCGCTGGG CCAAGCTCTA CAGCTTGGTG 540 
161 KTMS MGR MRG VRWA KLY SLV 180 

• ••••• 

541 ATCTGGGGGT GTACGCTGCT CCTGAGCTCA CCCATGCTGG TGTTCCGGAC CATGAAGGAG 600 
181 IWGC TLL LSS PMLV FRT MKE 200 

• ••«•• 

601 TACAGCGATG AGGGCCACAA CGTCACCGCT TGTGTCATCA GCTACCCATC CCTCATCTGG 660 
201 YSDE GHN VTA CVIS YPS LIW 220 



• ••••• 

661 GAAGTGTTCA CCAACATGCT CCTGAATGTC GTGGGCTTCC TGCTGCCCCT GAGTGTCATC 720 
221 EVFT N M Li LNV VGFL LPL SVI 240 

• ••••• 
721 ACCTTCTGCA CGATGCAGAT CATGCAGGTG CTGCGGAACA ACGAGATGCA GAAGTTCAAG 780 
241 TFCT MQI MQV LRNN EMQ KFK 260 

781 GAGATCCAGA CGGAGAGGAG GGCCACGGTG CTAGTCCTGG TTGTGCTGCT GCTATTCATC 840 
261 EIQT ERR A T V LVLV VLL LFI 280 

• ••••• 

841 ATCTGCTGGC TGCCCTTCCA GATCAGCACC TTCCTGGATA CGCTGCATCG CCTCGGCATC 900 
281 ICWL PFQ 1ST FLDT LHR LGI 300 



FIG. 30B 

901 CTCTCCAGCT GCCAGGACGA GCGCATCATC GACGTAATCA CACAGATCGC CTCCTTCATG 960 
301 LSSC QDE RII DVIT QIA SFM 320 

961 GCCTACAGCA ACAGCTGCCT CAACCCACTG GTGTACGTGA TCGTGGGCAA GCGCTTCCGA 1020 
321 AYSN SCL NPL VYVI VGK RFR 340 

1021 AAGAAGTCTT GGGAGGTGTA CCAGGGAGTG TGCCAGAAAG GGGGCTGCAG GTCAGAACCC 1080 
341 KKSW EVY QGV CQKG GCR SEP 360 

• • • • • 

1081 ATTCAGATGG AGAACTCCAT GGGCACACTG CGGACCTCCA TCTCCGTGGA ACGCCAGATT 1140 
361 IQME NSM GTL RTSI SVE RQI 380 

1141 CACAAACTGC AGGACTGGGC AGGGAGCAGA CAGTGAGCAA ACGCCAGCAG GGCTGCTGTG 1200 
381 HKLQ DWA GS R Q * 391 

1201 AATTTGTGTA AGGATTGAGG GACAGTTGCT TTTCAGCATG GGCCCAGGAA TGCCAAGGAG 1260 

1261 ACATCTATGC ACGACCTTGG GAAATGAGTT GATGTCTCCG GTAAAACACC GGAGACTAAT 1320 

1321 TCCTGCCCTG CCCAATTTTG CAGGGAGCAT GGCTGTGAGG ATGGGGTGAA CTCACGCACA 1380 

1381 GCCAAGGACT CCAAAATCAC AACAGCATTA CTGTTCTTAT TTGCTGCCAC ACCTGAGCCA 1440 

1441 GCCTGCTCCT TCCCAGGAGT GGAGGAGGCC TGGGGGGAGG GAGAGGAGTG ACTGAGCTTC 1500 

1501 CCTCCCGTGT GTTCTCCGTC CCTGCCCCAG CAAGACAACT TAGATCTCCA GGAGAACTGC 1560 

1561 CATCCAGCTT TGGTGCAATG GCTGAGTGCA CAAGTGAGTT GTTGCCCTGG GTTTCTTTAA 1620 

1621 TCTATTCAGC TAGAACTTTG AAGGACAATT TCTTGCATTA ATAAAGGTTA AGCCCTGAGG 1680 

1681 GGTCCCTGAT AACAACCTGG AGACCAGGAT TTTATGGCTC CCCTCACTGA TGGACAAGGA 1740 

• • • • * ■ 

1741 GGTCTGTGCC AAAGAAGAAT CCAATAAGCA CATATTGAGC ACTTGCTGTA TATGCAGTAT 1800 

1801 TGAGCACTGT AGGCAAGACC CAAGAAAGAG AAGGAGCCAT CTCCATCTTG AAGGAACTCA 1860 

1861 AAGACTCAAG TGGGAACGAC TGGGCACTGC CACCACCAGA AAGCTGTTCG ACGAGACGGT 1920 

1921 CGAGCAGGGT GCTGTGGGTG ATATGGACAG CAGAAGGGGG AGACCAAGGT TCCAGCTCAA 1980 

1981 CCAATAACTA TTGCACAACC ACCTGTCCCT GCCTCAGTTC CCTTTTATGT AACATGAAGT 2040 



FIG. 30C 

2041 CGTTGTGAGG GTTAAAGGCA GTAACAGGTA TAAAGTACTT AGAAAAGCAA AGGGTGCTAC 2100 

2101 GTACATGTGA GGCATCATTA CGCAGACGTA ACTGGGATAT GTTTACTATA AGGAAAAGAC 2160 

• ••••• 
2161 ACTGAGGTCT AGAAATAGCT CCGTGGAGCA GAATCAGTAT TGGGAGCCGG TGGCGGTGTG 2220 

2221 AAGCACCAGT GTCTGGCACA CAGTAGGTGC TCATTGGCTC CCTTCCACCT GTCATTCCCA 2280 

2281 CCACCCTGAG GCCCCAACCG CCACACACAC AGGAGCATTT GGAGAGAAGG CCATGTCTTC 2340 

2341 AAAGTCTGAT TTGTGATGAG GCAGAGGAAG AT&TTTCTAA TCGGTCTTGC CCAGAGGATC 2400 

2401 ACAGTGCTGA GACCCCCCAC CACCAGCCGG TACCTGGGAA GGGGGAGAGT GCAGGCCTGC 2460 

2461 TCAGGGACTG TTCCTGTCTC AGCAACCAAG GGATTGTTCC TGTCAATCAA TGGTTTATTG 2520 

• • • • • • 
2521 GAAGGTGGCC CAGTATGAGC CCTAGAAGAG TGTGAAAAGG AATGGCAATG GTGTTCACCA 2580 

• ••••• 
2581 TCGGCAGTGC CAGGGCAGCA CTCATTCACT TGATAAATGA ATATTTATTA GCTGGTTGGA 2640 

2641 GAGCTAGAAC CTGGAGAGCT AGAACCTGGA GAACTAGAAC CTGGAGGGCT AGAACCTGGA 2700 

• ••••• 
2701 GAGGCTAGAA CCAAGAAGGG CTAGAACCTG GAGGGGCTAG AACCTAGAGA AGCTAAAACC 2760 

2761 TGAGCTAGAA GCTGGAGGAC TAGAACCTGG AGGGCTGGAA TCTGAAGGGC TAGAACCTGG 2820 

2821 AGGGCTGGAA TCTGGAGAGC TAGAACCTGG AGGGCTAGAA CCTGGAGGGC TAGAACCTAG 2880 

• ••••• 
2881 AAGGGCTAGA ACCTGGAGGG CTGGAATCTG GAGAGCTAGA ACCTGGAGGG CTAGAACCTG. 2940 

• ••••• 
2941 GAGGGCTAGA ACCTAGAAGG GCTAGAACCT GGAGGGCTAG AACCTGGCAG GTTAGAACCT 3000 

• »•••• 
3001 AGAAGGGCTA GAACCTGGAG AGCCAGAACC TGGAGGGCTA GAACCTGGAA GGGCTAGAAC 3060 

• ••••• 
3061 CTGTAGAGCT AGAACATGGA GAGCTAGAAC CCGGCAGGCT AGAACCTGGC AAGCTAGAAC 3120 

3121 CTGGAGGGAA TGAACCTGGA GGGCTAGAAC CTGGAGAATG AGAAAAATTT ACATGGCAAA 3180 

• ••••• 
3181 GAGCC CATAA ATCCTGACCA ATCCAACTCT GAATTTTAAA GCAAAAGCGT GAAAAAAAAG 3240 



FIG. 30D 

• ••••• 

3241 ATTCCCTCCT TACCCCCAAC CCACTCTTTT TTCCCACCAC CCACTCTCCT CTGCCTCAGT 3300 

• ••*•• 
3301 AAGTATCTGG AGGAAGAAAA CAGGTGAAAG AAGAAGTAAA AACCATTTAG TATTAGTATT 3360 

• «•••• 
3361 AGAATGAAGT CAAACTGTGC CACACATGGT GAATGAAAAA AAAAAAAAAG AGGCTGTGTT 3420 

• ••••• 
3421 TTGTCACACA GGGCAGTCAT TCAGCACCAG AGCACGTGAT GGTCTGAGAC TCTCTTAGGA 3480 

3481 GCAGAGCTCT GCCGCAATGG CCATGTGGGG ATCCACACCT GGTCTGAGGG GCAACTGAGT 3540 

3541 CTGCGGGAGA AGAGCGGCCC TATGCATGGT GTAGATGCCC TGATAAAGAA CATCTGTCCT 3600 

• • • • • • 
3601 GTGAAAGACT CAATGAGCTG TTATGTTGTA AACAGGAAGC ATTTCACATC CAAACGAGAA 3660 

• ••••• 
3661 AATCATGTAA ACATGTGTCT TTTCTGTAGA GCATAATAAA TGGATGAGGT TTTTGCAAAA 3720 

3721 AAAAAAAAAA AAA 3733 



FIG. 31A 



1 ATGTTCTCTC CCTGGAAGAT ATCAATGTTT 
1MFSP WKI SMF 

• • • 
61 ACGGCCTCTT TCAGCGCCGA CATGCTCAAT 
21TASF SAD MLN 

• • • 
121 ACCTTTGCCC AGAGCAAATG CCCCCAAGTG 

41TFAQ SKC PQV 



181 CCCCCCTTCC TCTGGGTGCT GTTCGTGCTG 
61PPFL WVL FVL 

• • • 
241 GTCTTCTGCC TGCACAAGAG CAGCTGCACG 

81VFCL HKS SCT 

• • • 
301 GCAGCAGACC TGATCCTGGC CTGCGGGCTG 
101 AADL I L A CGL 

• * • 
361 TTCGACTGGC TCTTTGGGGA GACGCTCTGC 
121 FDWL FGE TLC 



• • • 
421 CTGTACAGCA GCATCTGTTT CCTGATGCTG 
141LYSS ICF L M L 

481 AAAACCATGT CCATGGGCCG GATGCGCGGC 
161 KTMS MGR MRG 

• • • 
541 ATCTGGGGGT GTACGCTGCT CCTGAGCTCA 
181 IWGC TLL LSS 



601 TACAGCGATG AGGGCCACAA CGTCACCGCT 
201 YSDE GHN VTA 



• • • 

661 GAAGTGTTCA CCAACATGCT CCTGAATGTC 
221 EVFT NML LNV 



• • • 
721 ACCTTCTGCA CGATGCAGAT CATGCAGGTG 
241 TFCT MQI MQV 

• • • 
781 GAGATCCAGA CGGAGAGGAG GGCCACGGTG 
261 E I Q T ERR A T V 

• • • 
841 ATCTGCTGGC TGCCCTTCCA GATCAGCACC 
281 ICWL PFQ 1ST 



CTGTCTGTTC GTGAGGACTC CGTGCCCACC 60 
LSVR EDS VPT 20 

• • • 
GTCACCTTGC AAGGGCCCAC TCTTAACGGG 120 
VTLQ GPT L N G 40 

GAGTGGCTGG GCTGGCTCAA CACCATCCAG 180 
EWLG WLN TIQ 60 

GCCACCCTAG AGAAGATCTT TGTCCTCAGC 240 
ATLE NIF VLS 80 

• • • 
GTGGCAGAGA TCTACCTGGG GAACCTGGCC 300 
VAEI YLG NLA 100 



• • • 

CCCTTCTGGG CCATCACCAT CTCCAACAAC 360 
PFWA ITI SNN 120 



CGCGTGGTGA ATGCCATTAT CTCCATGAAC 420 
RVVN All SMN 140 



• • • 

GTGAGCATCG ACCGCTACCT GGCCCTGGTG 480 
VSID RYL A L V 160 



• • • 

GTGCGCTGGG CCAAGCTCTA CAGCTTGGTG 540 
VRWA KLY SLV 180 



> • • 

CCCATGCTGG TGTTCCGGAC CATGAAGGAG 600 
PMLV FRT MKE 200 



TGTGTCATCA GCTACCCATC CCTCATCTGG 660 
CVIS YPS LIW 220 



GTGGGCTTCC TGCTGCCCCT GAGTGTCATC 720 
VGFL LPL SVI 240 



CTGCGGAACA ACGAGATGCA GAAGTTCAAG 780 
LRNN EMQ KFK 260 



• • • 
CTAGTCCTGG TTGTGCTGCT GCTATTCATC 840 
LVLV VLL LFI 280 

• • • 
TTCCTGGATA CGCTGCATCG CCTCGGCATC 900 
FLDT LHR LGI 300 



FIG. 31B 



901 CTCTCCAGCT GCCAGGACGA GCGCATCATC GATGTAATCA CACAGATCGC CTCCTTCATG 960 
301 LSSC QDE RII DVIT Q I A SFM 320 

961 GCCTACAGCA ACAGCTGCCT CAACCCACTG GTGTACGTGA TCGTGGGCAA GCGCTTCCGA 1020 
321 AYSN SCL NPL VYVI VGK RFR 340 

1021 AAGAAGTCTT GGGAGGTGTA CCAGGGAGTG TGCCAGAAAG AGGGCTGCAG GTCAGAACCC 1080 
341 KKSW EVY QGV CQKE GCR SEP 360 

1081 ATTCAGATGG AGAACTCCAT GGGCACACTG CGGACCTCCA TCTCCGTGGA ACGCCAGATT 1140 
361 IQME NSM GTL RTSI SVE RQI 380 

1141 CACAAACTGC AGGACTGGGC AGGGAGCAGA CAGTGAGCAA ACGCCAGCAG GGCTGCTGTG 1200 
381 HKLQ D W A GSR Q * 391 

1201 AATTTGTGTA AGGATTGAGG GACAGTTGCT TTTCAGCATG GGCCCAGGAA TGCCAAGGAG 1260 

1261 ACATCTATGC ACGACCTTGG GAAATGAGTT GATGTCTCCG GTAAAACACC GGAGACTAAT 1320 

1321 TCCTGCCCTG CCCAATTTTG CAGGGAGCAT GGCTGTGAGG ATGGGGTGAA CTCACGCACA 1380 

1381 GCCAAGGACT CCAAAATCAC AACAGCATTA CTGTTCTTAT TTGCTGCCAC ACCTGAGCCA 1440 

1441 GCCTGCTCCT TCCCAGGAGT GGAGGAGGCC TGGGGGGAGG GAGAGGAGTG ACTGAGCTTC 1500 

1501 CCTCCCGTGT GTTCTCCGTC CCTGCCCCAG CAAGACAACT TAGATCTCCA GGAGAACTGC 1560 

1561 CATCCAGCTT TGGTGCAATG GCTGAGTGCA CAAGTGAGTT GTTGCCCTGG GTTTC TTTAA 1620 

• • • • • • 
1621 TCTATTCAGC TAGAACTTTG AAGGACAATT TCTTGCATTA ATAAAGGTTA AGCCCTGAGG 1680 

1681 GGTCCCTGAT AACAACCTGG AGACCAGGAT TTTATGGCTC CCCTCACTGA TGGACAAGGA 1740 

1741 GGTCTGTGCC AAAGAAGAAT CCAATAAGCA CATATTGAGC ACTTGCTGTA TATGCAGTAT 1800 

1801 TGAGCACTGT AGGCAAGACC CAAGAAAGAG AAGGAGCCAT CTCCATCTTG AAGGAACTCA 1860 

• ••••• 
1861 AAGACTCAAG TGGGAACGAC TGGGCACTGC CACCACCAGA AAGCTGTTCG ACGAGACGGT 1920 

1921 CGAGCAGGGT GCTGTGGGTG ATATGGACAG CAGAAGGGGG AGACCAAGGT TCCAGCTCAA 1980 

1981 CCAATAACTA TTGCACAACC ACCTGTCCCT GCCTCAGTTC CCTTTTATGT AACATGAAGT 2040 



FIG. 31C 

2041 CGTTGTGAGG GTTAAAGGCA GTAACAGGTA TAAAGTACTT AGAAAAGCAA AGGGTGCTAC 2100 
2101 GTACATGTGA GGCATCATTA CGCAGACGTA ACTGGGATAT GTTTACTATA AGGAAAAGAC 2160 
2161 ACTGAGGTCT AGAAATAGCT CCGTGGAGCA GAATCAGTAT TGGGAGCCGG TGGCGGTGTG 2220 
2221 AAGCACCAGT GTC TGGC AC A CAGTAGGTGC TCATTGGCTC CCTTCCACCT GTCATTCCCA 2280 
2281 CCACCCTGAG GCCCCAACCG CCACACACAC AGGAGCATTT GGAGAGAAGG CCATGTCTTC 2340 
2341 AAAGTCTGAT TTGTGATGAG GCAGAGGAAG ATATTTCTAA TCGGTCTTGC CCAGAGGATC 2400 
2401 ACAGTGCTGA GACCCCCCAC CACCAGCCGG TACCTGGGAA GGGGGAGAGT GCAGGCCTGC 2460 
2461 TCAGGGACTG TTCCTGTCTC AGCAACCAAG GGATTGTTCC TGTCAATCAA TGGTTTATTG 2520 
2521 GAAGGTGGCC CAGTATGAGC CCTAGAAGAG TGTGAAAAGG AATGGCAATG GTGTTC AC CA 2580 
2581 TCGGCAGTGC CAGGGCAGCA CTCATTCACT TGATAAATGA ATATTTATTA GCTGGTTGGA 2640 
2641 GAGCTAGAAC CTGGAGAGCT AGAAC CTGGA GAACTAGAAC CTGGAGGGCT AGAACCTGGA 2700 
2701 GAGGCTAGAA CCAAGAAGGG CTAGAACCTG GAGGGGCTAG AACCTAGAGA AGCTAAAACC 2760 
2761 TGAGCTAGAA GCTGGAGGAC TAGAACCTGG AGGGCTGGAA TCTGAAGGGC TAGAACCTGG 2820 
2821 AGGGCTGGAA TCTGGAGAGC TAGAACCTGG AGGGCTAGAA CCTGGAGGGC TAGAACCTAG 2880 
2881 AAGGGCTAGA ACCTGGAGGG CTGGAATCTG GAGAGCTAGA ACCTGGAGGG CTAGAACCTG 2940 
2941 GAGGGCTAGA ACCTAGAAGG GCTAGAACCT GGAGGGCTAG AACCTGGCAG GTTAGAACCT 3000 

3001 AGAAGGGCTA GAACCTGGAG AGCCAGAACC TGGAGGGCTA GAACCTGGAA GGGCTAGAAC 3060 

• ••••• 

3061 CTGTAGAGCT AGAACATGGA GAGCTAGAAC CCGGCAGGCT AGAACCTGGC AAGCTAGAAC 3120 

3121 CTGGAGGGAA TGAACCTGGA GGGCTAGAAC CTGGAGAATG AGAAAAATTT ACATGGCAAA 3180 

3181 GAGCCCATAA ATCCTGACCA ATCCAACTCT GAATTTTAAA GCAAAAGCGT GAAAAAAAAG 3240 



FIG. 31D 



3241 ATTCCCTCCT TACCCCCAAC CCACTCTTTT TTCCCACCAC CCACTCTCCT CTGCCTCAGT 3300 



3301 AAGTATCTGG AGGAAGAAAA CAGGTGAAAG AAGAAGTAAA AACCATTTAG TATTAGTATT 3360 



3361 AGAATGAAGT CAAACTGTGC CACACATGGT GAATGAAAAA AAAAAAAAAG AGGCTGTGTT 3420 



3421 TTGTCACACA GGGCAGTCAT TCAGCACCAG AGCACGTGAT GGTC TGAGAC TCTCTTAGGA 3480 



3481 GCAGAGCTCT GCCGCAATGG CCATGTGGGG ATCCACACCT GGTCTGAGGG GCAACTGAGT 3540 



3541 CTGCGGGAGA AGAGCGGCCC TATGCATGGT GTAGATGCCC TGATAAAGAA CATCTGTCCT 3600 



3601 GTGAAAGACT CAATGAGCTG TTATGTTGTA AACAGGAAGC ATTTCACATC CAAACGAGAA 3660 



3661 AATCATGTAA ACATGTGTCT TTTCTGTAGA GCATAATAAA TGGATGAGGT TTTTGCAAAA 3720 



3721 AAAAAAAAAA AAA 3733 



FIG. 32A 



1 CGCCCAACCCAAGTTCAAAGGCTGATAAGAGAGAAAATCTCATGAGGAGGTTTTAGTCTA 6 0 



61 GGGAAAGTCATTCAGTGGATGTGATCTTGGCTCACAGGGGACGATGTCAAGCTCTTCCTG 120 
1 MSSSSW6 

121 GCTCCTTCTCAGCCTTGTTGCTGTAACTGCTGCTCAGTCCACCATTGAGGAACAGGCCAA 180 

7 L L L S LVAVTAAQSTI EEQAK 26 

• • • • • • 

181 GACATTTTTGGACAAGTTTAACCACGAAGCCGAAGACCTGTTCTATCAAAGTTCACTTGC 240 

27 TFLDKFNHEAEDLFYQSSLA 46 

• ••••• 

241 TTCTTGGAATTATAACACCAATATTACTGAAGAGAATGTCCAAAACATGAATAATGCTGG 300 

47 SWNYNTN I TEENVQNMNNAG 66 

• ••••• 

301 GGACAAATGGTCTGCCTTTTTAAAGGAACAGTCCACACTTGCCCAAATGTATCCACTACA 360 

67 DKWSAFLKEQSTLAQMYPLQ 86 

361 AGAAATTCAGAATCTCACAGTCAAGCTTCAGCTGCAG 420 

87 EIQNLTVKLQLQALQQNGSS 106 

• ••••• 

421 AGTGCTCTCAGAAGACAAGAGCAAACGGTTGAACACAATTCTAAATACAATGAGCACCAT 480 

107 VLSEDKSKRLNTILNTMSTI 126 

• ••••• 

481 CTACAGTACTGGAAAAGTTTGTAACCCAGATAATCCACAAGAATGCTTATTACTTGAACC 540 

127 YSTGKVCNPDNPQECLLLEP 146 

• •»••• 

541 AGGTTTGAATGAAATAATGGCAAACAGTTTAGACTACAATGAGAGGCTCTGGGCTTGGGA 600 

147 GLNEIMANSLDYNERLWAWE 166 

• *•••• 

601 AAGCTGGAGATCTGAGGTCGGCAAGCAGCTGAGGCCATTATATGAAGAGTATGTGGTCTT 660 

167 SWRSEVGKQLRPLYEEYVVL 186 

661 GAAAAATGAGATGGCAAGAGCAAATCATTATGAGGACTATGGGGATTATTGGAGAGGAGA 720 

187 KNEMARANHYEDYGDYWRGD 206 

• *•••• 

721 CTATGAAGTAAATGGGGTAGATGGCTATGACTAC^GCC 780 

207 YEVNGVDGYDYSRGQLIEDV 226 

781 GGAAC ATAC CTTTGAAGAGATTAAACC ATTATATGAAC ATCTTCATGC CTATGTGAGGGC 840 

227 EHTFEEIKPLYEHLHAYVRA 246 



841 AAAGTTGATGAATGCCTATCCTTCCTATATCAGTC^ 900 
247 KLMNAY P S Y I'S P I GC L PAHL 266 



FIG. 32B 



901 GCTTGGTGATATGTGGGGTAGATTTTGGACAAATCTGTACTCTTTGACAGTTC 960 
267 LGDM. WGRFWTNLYS LTVPFG 286 



961 ACAGAAACCAAACATAGATGTTACTGATGCAATGGT^ 1020 

287 QKPNIDVTDAMVDQAWDAQR 306 

1021 AATATTCAAGGAGGCCGAGAAGTTCTTTGTATCTGTTGGTCTTCCTAATATGACTCAAGG 1080 

307 IFKEAEKFFVSVGLPNMTQG 326 

1081 ATTCTGGGAAAATTCCATGCTAACGGACCCAGGAAATGTTCAGAAAGCAGTCTGCCATCC 1140 

327 FWENS. MLTDPGNVQKAVCHP 346 

1141 CACAGCTTGGGACCTGGGGAAGGGCGACTTCAGGATCCTTATGTGCACAAAGGTGACAAT 1200 

347 TAWDLGKGDFRI LMCTKVTM 366 



1201 GGACGACTTCCTGACAGCTCATCATGAGATGGGGCATATCCAGTATGATATGGC ATATGC 1260 
367 DDFLTAHHEMGHIQYDMAYA 386 



1261 TGCACAACCTTTTCTGCTAAGAAATGGAGCTAATGAAGGATTCCATGAAGCTGTTGGGGA 1320 

387 AQ PFLLRNGANEGFHEAVGE 406 

1321 AATCATGTCACTTTCTGCAGCCACACCTAAGCATTTAAAATCCATTGGTCTTCTGTCACC 1380 

407 IMSLSAATPKHLKSIGLLSP 426 

1381 CGATTTTCAAGAAGACAATGAAACAGAAATAAACTTCCTGCTCAAACAAGCACTCACGAT 1440 

427 DFQEDNETEINFLLKQALTI 446 

1441 TGTTGGGACTCTGCCATTTACTTACATGTTAGAGAAGTGGAGGTGGATGGTCTTTAAAGG 1500 

447 VGTLPFTYMLEKWRWMVFKG 466 



1501 GGAAATTCCCAAAGACCAGTGGATGAAAAAGTGGTGGGAGATGAAGCGAGAGATAGTTGG 1560 

467 EI PKDQWMKKWWEMKREIVG 486 

• 

1561 GGTGGTGGAACCTGTGCCCCATGATGAAACATACTGTGACCCCGCATCTCTGTTCCATGT 1620 

487 VVEPVPHDETYCDPASLFHV 506 

1621 TTCTAATGATTACTCATTCATTCGATATTACACAAGGACCCTTTACCAATTCCA 1680 

507 SNDYSFIRYYTRTLYQFQFQ 526 



1681 AGAAGCACTTTGTCAAGCAGCTAAACATGAAGGCCCTCTGCACAAATGTGACATCTCAAA 1740 
527 EALCQAAKHEGPLHKCDISN 546 



FIG. 32C 



1741 CTCTAC AGAAGCTGGAC AGAAACTGTTCAATATGCTGAGGCTTGGAAAATC AGAACCCTG 1800 

547 STEAGQKLFNMLRLGKSEPW 566 

1801 GACCCTAGCATTGGAAAATGTTGTAGGAGCAAAGAACATGAATGTAAGGCCACTGCTCAA 1860 

567 TLAL ENVVGAKNMNVRPLLN 586 

1861 CT AC TTTGAGC C C TT ATTT AC C TGGC TGAAAGACC AGAAC AAGAATTC TTTTGTGGGATG 1920 

587 YFEPLFTWLKDQNKNSFVGW 606 

1921 GAGTACCGACTGGAGTCCATATGCAGACCAAAGCATCAAAGTGAGGATAAGCCTAAAATC 1980 

607 STDWSPYADQSIKVRISLKS 626 

1981 AGCTCTTGGAGATAAAGCATATGAATGGAACGACAATGAAATGTACCTGTTCCGATCATC 2040 

627 ALGDKAYEWNDNEMYLFRSS 646 

• • • • • • 

2041 TGTTGC ATATGCTATGAGGC AGTACTTTTTAAAAGTAAAAAATC AGATGATTCTTTTTGG 2100 

647 VAYAMRQYFLKVKNQMILFG 666 

2101 GGAGGAGGATGTGCGAGTGGCTAATTTGAAACCAAGAATCTCCTTTAATTTCTTTGTCAC 2160 

667 EEDVRVANLKPRI S FNFFVT 686 

2161 TGCACCTAAAAATGTGTCTGATATCATTCCTAGAACTGAAGTTGAAAAGGCCATCAGGAT 2220 

687 APKNVSDI I PRTEVEKAIRM 706 

2221 GTCCCGGAGCCGTATCAATGATGCTTTCCGTCTGAATGACAACAGCCTAGAGTTTCTGGG 2280 

707 SRSRINDAFRLNDNSLEFLG 726 

2281 GATACAGCCAACACTTGGACCTCCTAACCAGCCCCCTGTTTCCATATGGCTGATTGTTTT 2340 

727 IQPTLGPPNQPPVSIWLIVF 746 

2341 TGGAGTTGTGATGGGAGTGATAGTGGTTGGCATTGTCATCCTGATCTTCACTGGGATCAG 2400 

747 GVVMGVIVVGIVILIFTGIR 766 

• • • • • • 

2401 AGATCGGAAGAAGAAAAATAAAGCAAGAAGTGGAGAAAATCCTTATGCCTCCATCGATAT 2460 

767 DRKKKNKARSGE NPYAS IDI 786 

2461 TAGCAAAGGAGAAAATAATCCAGGATTCCAAAACACTGATGATGTTCAGACCTCCTTTTA 2520 

787 SKGENNPGFQNTDDVQTSF* 806 

■ a • • « • 

2521 GAAAAATCTATGTTTTTCCTCTTGAGGTGATTTTGTTGTATGTAAATGTTAA 2580 

2581 TATAGAAAATATAAGATGATAAAGATATCATTAAATGTCAAAACTATGACTCTGTTCAGA 2640 



FIG. 32D 

• • • * • • 

2641 AAAAAAATTGTCCAAAGACAACATGGCCAAGGAGAGAGCATCTTCATTGACATTGCTTTC 2700 

• > • • • • m 

2701 AGTATTTATTTCTGTCTCTGGATTTGACTTCTGTTCTGTTTCTTAATAAGGATTTTGTAT 2760 
2761 TAGAGTATATTAGGGAAAGTGTGT ATTTGGTCTCAC AGGCTGTTCAGGGATAATCTAAAT 2820 

2821 GTAAATGTCTGTTGAATTTCTGAAGTTGAAAACAAGGA 2880 

• ••••• 

2881 GATCTTGTATGGAATATGGATGGATCACTTGTAAGGACAGTGCCTGGGAACTGGTGTAGC 2940 

• ••••• 

2941 TGCAAGGATTGAGAATGGCATGCATTAGCTCACTTTCATTTAATCCATTGTCAAGGATGA 3000 

• • • • * • 

3001 CATGCTTTCTTCACAGTAACTCAGTTCAAGTACTATGGTGATTTGCCTACAGTGATGTTT 3060 

• ••••• 

3061 GGAATCGATCATGCTTTCTTCAAGGTGACAGGTCTAAAGAGAGAAGAATCCAGGGAACAG 3120 

3121 GTAGAGGACATTGCTTTTTCACTTCCAAGGTGCTTGATCAACATCTCCCTGACAACACAA 3180 

3181 AACTAGAGCCAGGGGCCTCCGTGAACTCCCAGAGCATGCCTGATAGAAACTCATTTCTAC 3240 

3241 TGTTCTCTAACTGTGGAGTGAATGGAAATTCCAACTGTATGTTCACCCTCTGAAGTGGGT 3300 

3301 ACCCAGTCTCTTAAATCTTTTGTATTTGCT 3360 

3361 CAGACACTCAATAAATGCTAGATTTACACACTCAAAAAAAAAAAA 3405 



FIG. 33A 



1 ATGCATCTTATCGACTACCTGCTCCTCCTGCTGGTTGGACTACTGGCCCTTTCTCATGGC 60 
1MHLIDYLLLLLVGLLALSHG 20 



6 1 CAGCTGCACGTTGAGCATGATGGTGAGAGTTGCAGTAACAGCTCCCACCAGCAGATTCTG 120 
21QLHVEHDGESCSNSSHQQIL 40 



121 GAGACAGGTGAGGGCTCCCCCAGCCTCAAGATAGCCCCTGCCAATGCTGACTTTCCCTTC 180 

41ETGEGS PSLKIAPANADFAF 60 

181 CGCTTCTACTACCTGATCGCTTCGGAGACCCCGGGGAAGAACATCTTTTTCTCCCCGCTG 240 

61RFYYLIASETPGKNIFFSPL 80 



241 AGCATCTCGGCGGCCTACGCCATGCTTTCCCTGGGGGCCTGCTCACACAGCOGCAGCCAG 300 

81SISAAYAMLSLGACSHSRSQ 100 

301 ATCCTTGAGGGCCTGGGCTTCAACCTCACCGAGCTGTCTGAGTCCGATGTCCATAGGGGC 360 

101 I LEGLGFNLTELSESDVHRG 120 



361 TTCCAGCACCTCCTGCACACTCTCAACCTCCCCGGCCATGGGCTGGAAACACGCGTGGGC 420 
121 FQHLLHTLNLPGHGLETRVG 140 



421 AGTGCTCTGTTCCTGAGCCACAACCTGAAGTTCCTTGCAAAATTCCTGAATGACACCATG 480 
141 SALFLS HNLKFLAKFLNDTM 160 



481 GCCGTCTATGAGGCTAAACTCTTCCACACCAACTTCTACGACACTGTGGGCACAATCCAG 540 
161 AVYEAKLFHTNFYDTVGTIQ 180 



541 C TT ATC AAC GAC C ACGTC AAG AAGGAAACT C GAGGG AAGATTGTGG ATTTGGTC AGTGAG 600 
181 LINDHVKKETRGKIVDLVSE 200 



601 CTCAAGAAGGACGTCTTGATGGTGCTGGTGAATTACATTTACTTCAAAGCCCTGTGGGAG 660 
201 LKKDVLMVLVNYIYFKALWE 220 



661 AAACCATTCATTTCCTCAAGGACCACTCCCAAAGACTTCTATGTTGATGAGAACACAACA 720 
221 KPFISSRTTPKDFYVDENTT 240 



721 GTCCGGGTGCCCATGATGCTGCAGGACCAGGAGCATCACTGGTATCTTCATGACAGATAC 780 
241 VRVPMMLQDQEHHWYLHDRY 260 



781 TTGCCCTGCTCGGTGCTACGGATGGATTACAAAGGAGACGCAACCGTGTTTTTCATTCTC 840 
261 LPCSVLRMDYKGDATVFFIL 280 



FIG. 33B 



841 CCTAACCAAGGCAAAATGAGGGAGATTGAAGAGGTTCTGACTCCAGAGATGCTAATGAGG 900 

281 PNQGKMREIEEVLT PEMLMR 300 

901 TGGAACAACTTGTTGCGGAAGAGGAATTTTTACAAGAAGCTAGAGTTGCATCTTCCCAAG 960 

301 WNNLLRKRNFYKKL ELHLPK 320 

961 TTCTCCATTTCTGGCTCCTATGTATTAGATCAGATTTTGCCCAGGCTGGGCTTCACGGAT 1020 

321 FSISGSYVLDQILPRLGFTD 340 

1021 CTGTTCTCCAAGTGGGCTGACTTATCCGGCATCACC AAACAGCAAAAACTGGAGGCATCC 1080 

341 LFSKWADLSGITKQQKLEAS 360 

1081 AAAAGTTTCCACAAGGCC ACCTTGGACGTGGATGAGGCTGGC ACCGAGGCTGCAGCAGCC 1140 

361 KSFHKATLDVDEAGTEAAAA 380 

1141 ACCACGTTCGCGATCAAATTCTTCTCTGCCCAGACCAATCGCCACATCCTGCGATTCAAC 1200 

381 TTFAIKFFSAQTNRHILRFN 400 

1201 CGGCCCTTCCTTGTGGTGATCTTTTCCACCAGCACCCAGAGTGTCCTCTTTC 1260 

401 RPFLVVIFSTSTQSVLFLGK 420 



1261 GTCGTCGACCCCACGAAACCATAG 1284 
421 VVDPTKP* 428 



FIG. 34A 



1 ATGCATCTTATCGACTACCTGCTCCTCCTGCTGGTTGGACTACTGGCCCTTTCTCATGGC 
1MHLIDYLLLLLVGLLALSHG 



6 1 CAGCTGCACGTTGAGCATGATGGTGAGAGTTGCAGTAACAGCTCCCACCAGCAGATTCTG 
21QLHVEHDGESCSNSSHQQIL 



121 GAGACAGGTGAGGGCTCCCCCAGCCTCAAGATAGCCCCTGCC AATGCTGACTTTGCCTTC 
41ETGEGSPSLKIAPANADFAF 

181 CGCTTCTACTACCTGATCGCTTCGGAGACCCCGGGGAAGAACATCTTTTTCTCCCCGCTG 
61RFYYLIASETPGKNIFFSPL 



241 AGCATCTCGGCGGCCTACGCCATGCTTTCCCTGGGGGCCTGCTCAC ACAGCCGCAGCCAG 
81 SI SAAYAMLSLGACSHSRSQ 



301 ATCCTTGAGGGCCTGGGCTTCAACCTCACCGAGCTGTCTGAGTCCGATGTCCATAGGGGC 
101 ILEGLGFNLTELSESDVHRG 



361 TTCCAGCACCTCCTGCACACTCTCAACCTCCCCGGCCATGGGCTGGAAACACGCGTGGGC 
121 FQHLLHTLNLPGHGLETRVG 



421 AGTGCTCTGTTCCTGAGCCACAACCTGAAGTTCCTTGCAAAATTCCTGAATGACACCATG 
141 SALFLSHNLKFLAKFLNDTM 



481 GCCGTCTATGAGGCTAAACTCTTCCACACCAACTTCTACGACACTGTGGGCACAATCCAG 

161 AVYEAKLFHTNFYDTVGTIQ 

541 CTTATCAACGACCACGTCAAGAAGGAAACTCGAGGGAAGATTGTGGATTTGGTCAGTGAG 

181 LINDHVKKETRGKIVDLVSE 



601 CTCAAGAAGGACGTCTTGATGGTGCTGGTGAATTACATTTACTTCAAAGCCCTGTGGGAG 
201 LKKDVLMVLVNYIYFKALWE 



661 AAACCATTCATTTCCTCAAGGACCACTCCCAAAGACTTTTATGTTGATGAGAACACAACA 

221 KPFI SSRTTPKDFYVDENTT 

721 GTCCGGGTGCCCATGATGCTGCAGGACCAGGAGCATCACTGGTATCTTCATGACAGATAC 

241 VRVPMMLQDQEHHWYLHDRY 



781 TTGCCCTGCTCGGTGCTACGGATGGATTACAAAGGAGACGCAACCGTGTTTTTCATTCTC 
261. LPCSVLRMDYKGDATVFFIL 



FIG. 34B 



• ••••• 

841 C C TAAC C AAGGC AAAATGAGGGAGATTGAAGAGGTTCTGAC TC C AGAGATGCTAATGAGG 900 
281 PNQGKMREIEEVLTPEMLMR 300 



901 TGGAACAACTTGTTGCGGAAGAGGAATTTTTACAAGAAGCTAGAGTTGCATCTTCCCAAG 960 
301 WNNLLRKRNFYKKLELHLPK 320 



961 TTCTCCATTTCTGGCTCCTATGTATTAGATCAGATTTTGCCCAGGCTGGGCTTCACGGAT 1020 

321 FSISGSYVLDQILPRLGFTD 340 

• ••••• 

1021 CTGTTCTCCAAGTGGGCTGACTTATCCGGCATCACCAAACAGCAAAAACTGGAGGCATCC 1080 

341 LFSKWADLSGITKQQKLEAS 360 

1081 AAAAGTTTCCAC AAGGCC ACCTTGGACGTGGATGAGGCTGGCACCGAGGCTGCAGCAGCC 1140 

361 KSFHKATLDVDEAGTEAAAA 380 



• • • • • 

1141 ACCACGTTCGCGATCAAATTCTTCTCTGCCC AGACC AATCGCCACATCCTGCGATTCAAC 1200 

381 TTFAIKFFSAQTNRHILRFN 400 

• ••••• 

1201 CGGCCCTTCCTTGTGGTGATCTTTTCCACCAGCACCCAGAGTGTCCTCTTTCTGGGCAAG 1260 

401 RPFLVVIFSTSTQSVLFLGK 420 



1261 GTCGTCGACCCCACGAAACCATAG 1284 
421 VVDPTKP* 428 



FIG. 35A 



1 ATGCATCTTATCGACTACCTGCTCCTCCTGCTGGTTGGACTACTGGCCCTTTCTCATGGC 
1MHLIDYLLLLLVGLLALSHG 



6 1 CAGCTGCACGTTGAGCATGATGGTGAGAGTTGCAGTAACAGCTCCCACCAGCAGATTCTG 
21QLHVEHDGESCSNSSHQQIL 



121 GAGACAGGTGAGGGCTCCCCCAGCCTCAAGATAGCCCCTGCCAATGCTGACTTTGCCTTC 
41 ETGEGSPSLKIAPANADFAF 



181 CGCTTCTACTACCTGATCGCTTCGGAGACCCCGGGGAAGAACATCTTTTTCTCCCCGCTG 
61RFYYLIASETPGKNIFFSPL 



241 AGCATCTCGGCGGCCTACGCCATGCTTTCCCTGGGGGCCTGCTCACACAGCCGCAGCCAG 
81 SI SAAYAMLSLGACSHSRSQ 



301 ATCCTTGAGGGCCTGGGCTTCAACCTCACCGAGCTGTCTGAGTCCGATGTCCATAGGGGC 

101 ILEGLGFNLTELSESDVHRG 

361 TTCCAGCACCTCCTGCACACTCTCAACCTCCCCGGCCATGGGCTGGAAACACGCGTGGGC 

121 FQHLLHTLNLPGHGLETRVG 



421 AGTGCTCTGTTCCTGAGCCACAACCTGAAGTTCCTTGCAAAATTCCTGAATGACACCATG 
141 SALFLSHNLKFLAKFLNDTM 



481 GCCGTCTATGAGGCTAAACTCTTCCACACCAACTTCTACGACACTGTGGGCACAATCCAG 

161 AVYEAKLFHTNFYDTVGTIQ 

541 CTTATCAACGACCACGTCAAGAAGGAAACTCGAGGGAAGATTGTGGATTTGGTCAGCGAG 

181 LINDHVKKETRGKIVDLVSE 



601 CTCAAGAAGGACGTCTTGATGGTGCTGGTGAATTACATTTACTTCAAAGCCCTGTGGGAG 
201 LKKDVLMVLVNYIYFKALWE 



661 AAACCATTCATTTCCTCAAGGACCACTCCCAAAGACTTCTATGTTGATGAGAACACAACA 

221 KPF I S SRTT PKDFYVDENTT 

721 GTCCGGGTGCCCATGATGCTGCAGGACCAGGAGCATCACTGGTATCTTCATGACAGATAC 

241 VRVPMMLQDQEHHWYLHDRY 



781 TTGCCCTGCTCGGTGCTACGGATGGATTACAAAGGAGACGCAACCGTGTTTTTCATTC' 
261 LPCSVLRMDYKGDATVFF I L 



FIG. 35B 



841 CCTAACCAAGGCAAAATGAGGGAGATTGAAGAGGTTCTGACTCCAGAGATGCTAATGAGG 900 

281 PNQGKMREIEEVLTPEMLMR 300 

901 TGGAACAACTTGTTGCGGAAGAGGAATTTTTACAAGAAGCTAGAGTTGCATCTTCCCAAG 960 

301 WNNLLRKRNFYKKLELHLPK 320 



961 TTCTCCATTTCTGGCTCCTATGTATTAGATCAGATTTTGCCCAGGCTGGGCTTCACGGAT 1020 
321 FSISGSYVLDQILPRLGFTD 340 



1021 CTGTTCTCCAAGTGGGCTGACTTATCCGGCATCACCAAACAGCAAAAACTGGAGGCATCC 1080 

341 LFSKWADLSGITKQQKLEAS 360 

1081 AAAAGTTTCCAC AAGGCCACCTTGGACGTGGATGAGGCTGGCACCGAGGCTGCAGC AGCC 1140 

361 KSFHKATLDVDEAGTEAAAA 380 



1141 ACCACGTTCGCGATCAAATTCTTCTCTGCCCAGACCAATCGCCACATCCTGCGATTCAAC 1200 
381 TTFAIKFFSAQTNRHILRFN 400 



1201 CGGCCCTTCCTTGTGGTGATCTTTTCCACCAGCACCCAGAGTGTCCTCTTTCTGGGCAAG 1260 

401 RPFLVVIFSTSTQSVLFLGK 420 

1261 GTCGTCGACCCCACGAAACCATAG 1284 

421 VVDPTKP* 428 



FIG. 36A 



1 ATGCATCTTATCGACTACCTGCTCCTCCTGCTGGTTGGACTACTGGCCCTTTCTCATGGC 
1MHLIDYLLLLLVGLLALSHG 



6 1 CAGCTGCACGTTGAGCATGATGGTGAGAGTTGCAGTAACAGCTCCCACCAGCAGATTCTG 
21QLHVEHDGESCSNSSHQQIL 

121 GAGACAGGTGAGGGCTCC CCCAGC CTCAAGATAGCCCCTGCCAATGCTGACTTTGCCTTC 
41 ETGEGS PSLKIAPANADFAF 



181 CGCTTCTACTACCTGATCGCTTCGGAGACCCCGGGGAAGAACATCTTTTTCTCCCCGCTG 
61RFYYLIASETPGKNIFFSPL 



241 AGCATCTCGGCGGCCTACGCCATGCTTTCCCTGGGGGCCTGCTCACACAGCCGCAGCCAG 

81 SISAAYAMLSLGACSHSRSQ 

• ••••• 

301 ATCCTTGAGGGCCTGGGCTTCAACCTCACCGAGCTGTCTGAGTCCGATGTCCATAGGGGC 

101 ILEGLGFNLTELSESDVHRG 

361 TTCCAGCACCTCCTGCACACTCTCAACCTCCCCGGCCATGGGCTGGAAACACGCGTGGGC 

121 FQHLLHTLNL PGHGLETRVG 

421 AGTGCTCTGTTCCTGAGCCACAACCTGAAGTTCCTTGCAAAATTCCTGAATGACACCATG 

141 SALFLSHNLKFLAKFLNDTM 

481 GCCGTCTATGAGGCTAAACTCTTCCACACCAACTTCTACGACACTGTGGGCACAATCCAG 

161 AVYEAKLFHTNFYDTVGTIQ 

541 CTTATCAACGACCACGTCAAGAAGGAAACTCGAGGGAAGATO 

181 LINDHVKKETRGKIVDLVSE 



601 CTCAAGAAGGACGTCTTGATGGTGCTGGTGAATTACATTTACTTCAAAGCCC 

201 LKKDVLMVLVNYIYFKALWE 

661 AAACCATTCATTTCCTCAAGGACCACTCCCAAAGACTTNTATGTTGATGAGAACAC 

221 KPFI SSRTTPKDFYVDENTT 



721 GTCCGGGTGCCCATGATGCTGCAGGACCAGGAGCATCACTGGTATCTTCATGACAGATAC 
241 VRVPMMLQDQ EHHWYLHDRY 



781 TTGCCCTGCTCGGTGCTACGGATGGATTACAAAGGAGACGCAACCGTGTTTTTCATTCTC 
261 LPCSVLRMDYKGDATVFFIL 



FIG. 36B 



• a • * • • 

841 CCTAACCAAGGCAAAATGAGGGAGATTGAAGAGGTTCTGACTCC AGAGATGCTAATGAGG 900 

281 PNQGKMREIEEVLTPEMLMR 300 

901 TGGAACAACTTGTOGCGGAAGAGGAATTTTTACAA 960 

301 WNNLLRKRNFYKKLELHLPK 320 

• ••••• 

961 TTCTCCATTTCTGGCTCCTATGTATTAGATCAGATTTTGCCCAGGCTGGGCTTCACGGAT 1020 

321 FSISGSYVLDQILPRLGFTD 340 

• ••••• 

1021 CTGTTCTCCAAGTGGGCTGACTTATCCGGCATCACCAAACAGCAAAAACTGGAGGCATCC 1080 

341 LFSKWADLSGITKQQKLEAS 360 

1081 AAAAGTTTCCACAAGGCCACCTTGGACGTGGATGAGGCTGGCACCGAGGCTGCAGCAGCC 1140 

361 KSFHKATLDVDEAGTEAAAA 380 



1141 ACC ACGTTCGCGATCAAATTCTTCTCTGCCCAGACC AATCGCCACATCCTGCGATTCAAC 1200 

381 TTFAIKFFSAQTNRHILRFN 400 

1201 CGGCCCTTCCTTGTGGTGATCTTTTCCACCAGCACCCAGAGTGTCCTCTTTCTG^ 1260 

401 RPFLVVIFSTSTQSVLFLGK 420 



1261 GTCGTCGACCCCACGAAACCATAG 1284 
421 VVDPTKP* 428 



FIG. 37 A 



• • • • • 

1 C GCCC AAC C C AAGTTC AAAGGC TG AT AAGAGAG AAAATC TC ATGAGGAGGTTTT AGTCT A 6 0 



6 1 GGGAAAGTCATTCAGTGGATGTGATCTTGGCTCACAGGGGACGATGTCAAGCTCTTCCTG 120 
1 MSSSSW6 

• ••>•• 

121 GCTCCTTCTC AGCCTTGTTGCTGTAACTGCTGCTC AGTCCACCATTGAGGAACAGGCCAA 180 

7 LLLSLVAVTAAQSTIEEQAK 26 

181 GACATTTTTGGACAAGTTTAACCACGAAGCCGAAGACCTGTTCTATCAAAGTTCACTTGC 240 

27 TFLDKFNHEAEDLFYQS SLA 46 

241 TTCTTGGAATTATAACACCAATATTACTGAAGAGAATGTCCAAAACATGAATAATGCTGG 300 

47 SWNYNTNI TEENVQNMNNAG 66 

301 GGACAAATGGTCTGCCTTTTTAAAGGAACAGTCCACACTTGCCCAAATGTATCCACTACA 360 

67 DKWSAFLKEQSTLAQMYPLQ 86 

361 AGAAATTCAGAATCTCACAGTGAAGCTTCAGCTGCAGGCTCTTCAGCAAAATGGGTCTTC 420 

87 EIQNLTVKLQLQALQQNGSS 106 

• • • * • * 

421 AGTGCTCTCAGAAGACAAGAGCAAACGGTTGAACACAATTCTAAATAC AATGAGCACCAT 480 

107 VLSEDKSKRLNTILNTMSTI 126 

• • • • • • 

481 CTACAGTACTGGAAAAGTTTGTAACCCAGATAATCC ACAAGAATGCTTATTACTTGAACC 540 

127 YSTGKVCNPDNPQECLLLEP 146 



541 AGGTTTGAATGAAATAATGGCAAACAGTTTAGACTACAATGAGAGGCTCTGGGCTTGGGA 600 

147 GLNE IMANSLDYNERLWAWE 166 

• ••••• 

601 AAGCTGGAGATCTGAGGTCGGCAAGCAGCTGAGGCCATTATATGAAGAGTATGTGGTCTT 660 

167 SWRS EVGKQLRPLYEEYVVL 186 

661 GAAAAATGAGATGGCAAGAGCAAATCATTATGAGGACTATGGGGATTATTGGAGAGGAGA 720 

187 KNEMARANHYEDYGDYWRGD 206 

• ••••• 

721 CTATGAAGTAAATGGGGTAGATGGCTATGACTACAGCCGCGGCCAGTTGATTGAAGATGT 780 

207 YEVNGVDGYDYSRGQLIEDV 226 



781 GGAAC ATACCTTTGAAGAGATTAAACCATTATATGAACATCTTC ATGCCTATGTGAGGGC 840 
227 EHTFEEIKPLYEHLHAYVRA 246 



841 AAAGTTGATGAATGCCTATCCTTCCTATATCAGTCCAATTGGATGCCTCCCTGCTCATTT 900 
247 KLMNAYPSYISPIGCLPAHL 266 



FIG. 37B 



901 GCTTGGTGATATGTGGGGTAGATTTTGGAC AAATCTGTACTCTTTGACAGTTCCCTTTGG 960 

267 LGDMWGRFWTNLYSLTVPFG 286 

• • « • * • 

961 ACAGAAACCAAACATAGATGTTACTGATGCAATGGTGGACCAGGCCTGGGATGCACAGAG 1020 

287 QKPNIDVTDAMVDQAWDAQR 306 

• • • • • • 

1021 AATATTCAAGGAGGCCGAGAAGTTCTTTGTATCTGTTGGTCTTCCTAATATGACTCAAGG 1080 

307 IFKEAEKFFVSVGLPNMTQG 326 

1081 ATTCTGGGAAAATTCCATGCTAACGGACCCAGGAAATGTTCAGAAAGCAGTCTGCCATCC 1140 

327 FWENSMLTDPGNVQKAVCHP 346 

• • • • • • 

1141 CACAGCTTGGGACCTGGGGAAGGGCGACTTCAGGATCCTTATGTGCAC AAAGGTGAC AAT 1200 

347 TAWDLGKGDFRI LMCTKVTM 366 

1201 GGACGACTTCCTGACAGCTCATCATGAGATGGGGC ATATCCAGTATGATATGGC ATATGC 1260 

367 DDFLTAHHEMGH I QYDMAYA 386 

1261 TGCACAACCTTTTCTGCTAAGAAATGGAGCTAATGAAGGATTCCATGAAGCTGTTGGGGA 1320 

387 AQPFLLRNGANEGFHEAVGE 406 

1321 AATCATGTCACTTTCTGC AGCCACACCTAAGCATTTAAAATCCATTGGTCTTCTGTCACC 1380 

407 IMSLSAATPKHLKS IGLLSP 426 

• • • • • • 

1381 CGATTTTCAAGAAGACAATGAAACAGAAATAAACTTCCTGCTCAAACAAGCACTCACGAT 1440 

427 DFQEDNETEINFLLKQA LTI 446 

. • • • ■ • 

1441 TGTTGGGACTCTGCC ATTTACTT AC ATGTTAGAGAAGTGGAGGTGGATGGTCTTTAAAGG 1500 

447 VGTLPFTYMLEKWRWMVFKG 466 

. . • • • • 

1501 GGAAATTCCCAAAGACCAGTGGATGAAAAAGTGGTGGGAGATGAAGCGAGAGATAGTTGG 1560 

467 EIPKDQWMKKWWEMKREIVG 486 

• >•••• 

1561 GGTGGTGGAACCTGTGCCCCATGATGAAACATACTGTGACCCCGCATCTCTGTTCCATGT 1620 
487 VVEPVPHDETYCDPASLFHV 506 

1621 TTCTAATGATTACTCATTCATTCGATATTACAC AAGGACCCTTTACCAATTCCAGTTTCA 1680 
507 SNDYSFIRYYTRTLYQFQFQ 526 

• • • • • • 

1681 AGAAGC ACTTTGTCAAGCAGCTAAACATGAAGGCCCTCTGCACAAATGTGACATCTCAAA 1740 
527 EALCQAAKHEGPLHKCDISN 546 



FIG. 37C 



1741 CTCTAC AGAAGCTGGACAGAAACTGTTCAATATGCTGAGGCTTGGAAAATC AGAACCCTG 1800 

547 STEAGQKLFNMLRLGKSEPW 566 

1801 GACCCTAGCATTGGAAAATGTTGTAGGAGCAAAGAACATGAATGTAAGGCCACTGCTCAA 1860 

567 TLALENVVGAKNMNVRPLLN 586 

1861 CTACTTTGAGCCCTTATTTACCTGGCTGAAAGACCAGAACAAGAATTCTTTTGTGGGATG 1920 

587 YFEPLFTWLKDQNKNS FVGW 606 



1921 GAGTACCGACTGGAGTCC ATATGCAGACCAAAGCATCAAAGTGAGGATAAGCCTAAAATC 1980 

607 STDWSPYADQSIKVRISLKS 626 

• ••••• 

1981 AGCTCTTGGAGATAAAGC ATATGAATGGAACGACAATGAAATGTACCTGTTCCGATCATG 2040 

627 ALGDKAYEWNDNEMYLFRSS 646 



2041 TGTTGC ATATGCTATGAGGC AGTACTTTTT AAAAGT AAAAAATC AGATGATTCTTTTTGG 2100 
647 VAYAMRQYFLKVKNQMILFG 666 



• ••••• 

2101 GGAGGAGGATGTGCGAGTGGCTAATTTGAAACCAAGAATCTCCTTTAATTTCTTTGTCAC 2160 

667 EEDVRVANLKPRI SFNFFVT 686 

2161 TGCACCTAAAAACGTGTCTGATATCATTCCTAGAACTGAAGTTGAAAAGGCCATCAGGAT 2220 

687 APKNVSDI I PRTEVEKAI RM 706 

2221 GTCCCGGAGCCGTATCAATGATGCTTTCCGTCTGAATGACAACAGCCTAGAGTTTCTGGG 2280 

707 SRSRINDAFRLNDNSLEFLG 726 

2281 GATACAGCCAAC ACTTGGACCTCCTAACCAGCCCCCTGTTTCCATATGGCTGATTGTTTT 2340 

727 IQPTLGPPNQPPVSIWLIVF 746 

2341 TGGAGTTGTGATGGGAGTGATAGTGGTTGGCATTGTCATCCTGATCTTCACTGGGATCAG 2400 

747 GVVMGVIVVGIVILIFTGIR 766 



• • • • 

2401 AGATCGGAAGAAGAAAAATAAAGC AAGAAGTGGAGAAAATCC TT ATGCC TCCATCGATAT 2460 

767 DRKKKNKARSGENPYASIDI 786 

2461 TAGCAAAGGAGAAAATAATCCAGGATTCCAAAACACTGATGATGTTCAGACCTCCTTTTA 2520 

787 SKGENNPGFQNTDDVQTSF* 806 

2521 GAAAAATCTATGTTTTTCCTCTTGAGGTGATTTTGT 2580 

• • • • • • 

2581 TATAGAAAATATAAGATGATAAAGATATCATTAAATGTCAAAACTATGACTCTGTTCAGA 2640 



FIG. 37D 

2641 AAAAAAATTGTCCAAAGACAACATGGCCAAGGAGAGAGCATCTTCATTGACATTGCTTTC 2700 

2701 AGTATTTATTTCTGTCTCTGGATTTGACTTCTGTTCTGTTTCTTAATAAGGATTTTGTAT 2760 

2761 TAGAGTATATTAGGGAAAGTGTGTATTTGGTCTCACAGGCTGTTCAGGGATAATCTAAAT 2820 

• ••••• 

2821 GTAAATGTCTGTTGAATTTCTGAAGTTGAAAACAAGGATATATCATTGGAGCAAGTGTTG 2880 

2881 GATCTTGTATGGAATATGGATGGATCACTTGTAAGGACAGTGCCTGGGAACTGGTGTAGC 2940 

2941 TGCAAGGATTGAGAATGGCATGCATTAGCTCACTTTCATTTAATCCATTGTCAAGGATGA 3000 

3001 CATGCTTTCTTCAC AGTAACTCAGTTCAAGTACTATGGTGATTTGCCTACAGTGATGTTT 3060 

3061 GGAATCGATCATGCTTTCTTCAAGGTGACAGGTCTAAAGAGAGAAGAATCCAGGGAACAG 3120 

3121 GTAGAGGACATTGCTTTTTCACTTCCAAGGTGCTTGATCAAC ATCTCCCTGACAACACAA 3180 

3181 AACTAGAGCC AGGGGCCTCCGTGAACTCCC AGAGCATGCCTGATAGAAACTCATTTCTAC 3240 

3241 TGTTCTCTAACTGTGGAGTGAATGGAAATTCCAACTGTATGTTCACCCTCTGAAGTGGGT 3300 

3301 ACCCAGTCTCTTAAATCTTTTGTATTTGCTCACAGTGTTTGAGCAGTGCTGAGCACAAAG 3360 

3361 CAGACACTC AATAAATGCTAGATTTACACACTCAAAAAAAAAAAA 3405 



FIG. 38A 



1 ATGTTCTCTC CCTGGAAGAT ATCAATGTTT 
1MFSP WKI SMF 



61 ACGGCCTCTT TCAGCGCCGA CATGCTCAAT 
21TASF SAD MLN 

• • • 

121 ACCTTTGCCC AGAGCAAATG CCCCCAAGTG 
41TFAQ SKC PQV 



181 CCCCCCTTCC TCTGGGTGCT GTTCGTGCTG 
61PPFL WVL FVL 



241 GTCTTCTGCC TGCACAAGAG CAGCTGCACG 

81VFCL HKS SCT 

• • • 
301 GCAGCAGACC TGATCCTGGC CTGCGGGCTG 
101 AADL I L A CGL 

• • • 
361 TTCGACTGGC TCTTTGGGGA GACGCTCTGC 
121 FDWL FGE TLC 

• • • 
421 CTGTACAGCA GCATCTGTTT CCTGATGCTG 
141 LYSS ICF LML 

• • • 
481 AAAACCATGT CCATGGGCCG GATGCGCGGC 
161 KTMS MGR MRG 

• • • 
541 ATCTGGGGGT GTACGCTGCT CCTGAGCTCA 
181 IWGC TLL LSS 

■ • • 
601 TACAGCGATG AGGGCCACAA CGTCACCGCT 

201 YSDE GHN VTA 



• • • 

661 GAAGTGTTCA CCAACATGCT CCTGAATGTC 
221 EVFT NML LNV 



• • • 
721 ACCTTCTGCA CGATGCAGAT CATGCAGGTG 
241 TFCT MQI MQV 

• • • 
781 GAGATCCAGA CGGAGAGGAG GGCCACGGTG 
261 EIQT ERR A T V 



• * • 

841 ATCTGCTGGC TGCCCTTCCA GATCAGCACC 
281 IC WL PFQ 1ST 



CTGTCTGTTN GTGAGGNCTC CGTGCCCACC 60 

LSVX EXS VPT 20 

GTCACCTTGC AAGGGCCCAC TCTTAACGGG 120 

VTLQ GPT LNG 40 

• • • 
GAGTGGCTGG GCTGGCTCAA CACCATCCAG 180 

EWLG WLN TIQ 60 



GCCACCCTAG AGAACATCTT TGTCCTCAGC 240 
ATLE NIF VLS 80 

• • • 

GTGGCAGAGA TCTACCTGGG GAACCTGGCC 300 
VAEI YLG NLA 100 



CCCTTCTGGG CCATCACCAT CTCCAACAAC 360 
PFWA ITI SNN 120 



. • * 

CGCGTGGTGA ATGCCATTAT CTCCATGAAC 420 
RVVN All SMN 140 



• • • 

GTGAGCATCG ACCGCTACCT GGCCCTGGTG 480 
VSID RYL ALV 160 



GTGCGCTGGG CCAAGCTCTA CAGCTTGGTG 540 
V R W A K L Y S L V 180 

• • • 

CCCATGCTGG TGTTCCGGAC CATGAAGGAG 600 
PMLV FRT MKE 200 



• • • 

TGTGTCATCA GCTACCCATC CCTCATCTGG 660 
CVIS YPS LIW 220 



GTGGGCTTCC TGCTGCCCCT GAGTGTCATC 720 
VGFL LPL SVI 240 



• • • 

CTGCGGAACA ACGAGATGCA GAAGTTCAAG 780 
LRNN EMQ KFK 260 



CTAGTCCTGG TTGTGCTGCT GCTATTCATC 840 
LVLV VLL LFI 280 



TTCCTGGATA CX3CTGCATCG CCTCGGCATC 900 
FLDT LHR LGI 300 



FIG. 38B 

901 CTCTCCAGCT GCCAGGACGA GCGCATCATC GANGTAATCA CACAGATCGC CTCCTTCATG 960 
301LSSC QDE RII DVIT QIA SFM 320 

961 GCCTACAGCA ACAGCTGCCT CAACCCACTG GTGTACGTGA TCGTGGGCAA GCGCTTCCGA 1020 
321AYSN SCL NPL VYVI VGK RFR 340 

1021 AAGAAGTCTT GGGAGGTGTA CCAGGGAGTG TGCCAGAAAG NGGGCTGCAG GTCAGAACCC 1080 
341KKSW EVY QGV CQKX GCR SEP 360 

1081 ATTCAGATGG AGAACTCCAT GGGCACACTG CGGACCTCCA TCTCCGTGGA ACGCCAGATT 1140 
361IQME NSM GTXi RTSI SVE RQI 380 

1141 CACAAACTGC AGGACTGGGC AGGGAGCAGA CAGTGAGCAA ACGCCAGCAG GGCTGCTGTG 1200 
381 HKLQ DWA GSR Q * 391 

1201 AATTTGTGTA AGGATTGAGG GACAGTTGCT TTTCAGCATG GGCCCAGGAA TGCCAAGGAG 1260 

1261 ACATCTATGC ACGACCTTGG GAAATGAGTT GATGTCTCCG GTAAAACACC GGAGACTAAT 1320 

1321 TCCTGCCCTG CCCAATTTTG CAGGGAGCAT GGCTGTGAGG ATGGGGTGAA CTCACGCACA 1380 

1381 GCCAAGGACT CCAAAATCAC AACAGCATTA CTGTTCTTAT TTGCTGCCAC AC CTGAGCCA 1440 

1441 GCCTGCTCCT TCCCAGGAGT GGAGGAGGCC TGGGGGGAGG GAGAGGAGTG ACTGAGCTTC 1500 

1501 CCTCCCGTGT GTTCTCCGTC CCTGCCCCAG CAAGACAACT TAGATCTCCA GGAGAACTGC 1560 

1561 CATCCAGCTT TGGTGCAATG GCTGAGTGCA CAAGTGAGTT GTTGCCCTGG GTTTCTTTAA 1620 

1621 TCTATTCAGC TAGAACTTTG AAGGACAATT TCTTGCATTA ATAAAGGTTA AGCCCTGAGG 1680 

1681 GGTCCCTGAT AACAACCTGG AGACCAGGAT TTTATGGCTC CCCTCACTGA TGGACAAGGA 1740 

1741 GGTCTGTGCC AAAGAAGAAT CCAATAAGCA CATATTGAGC ACTTGCTGTA TATGCAGTAT 1800 

1801 TGAGCACTGT AGGCAAGACC CAAGAAAGAG AAGGAGCCAT CTCCATCTTG AAGGAACTCA 1860 

1861 AAGACTCAAG TGGGAACGAC TGGGCACTGC CACCACCAGA AAGCTGTTCG ACGAGACGGT 1920 

1921 CGAGCAGGGT GCTGTGGGTG ATATGGACAG CAGAAGGGGG AGACCAAGGT TCCAGCTCAA 1980 

. ■ • 

1981 CCAATAACTA TTGCACAACC ACCTGTCCCT GCCTCAGTTC CCTTTTATGT AACATGAAGT 2040 



FIG. 38C 



2041 CGTTGTGAGG GTTAAAGGCA GTAACAGGTA TAAAGTACTT AGAAAAGCAA AGGGTGCTAC 2100 



2101 GTACATGTGA GGCATCATTA CGCAGACGTA ACTGGGATAT GTTTACTATA 



2161 ACTGAGGTCT AGAAATAGCT CCGTGGAGCA GAATCAGTAT TGGGAGCCGG 



2221 AAGCACCAGT GTCTGGCACA CAGTAGGTGC TCATTGGCTC CCTTCCACCT 



2281 CCACCCTGAG GCCCCAACCG CCACACACAC AGGAGCATTT GGAGAGAAGG 



2341 AAAGTCTGAT TTGTGATGAG GCAGAGGAAG ATATTTC TAA TCGGTCTTGC 



2401 ACAGTGCTGA GACCCCCCAC CACCAGCCGG TACCTGGGAA GGGGGAGAGT 



2461 TCAGGGACTG TTCCTGTCTC AGCAACCAAG GGATTGTTCC TGTCAATCAA 



2521 GAAGGTGGCC CAGTATGAGC CCTAGAAGAG TGTGAAAAGG AATGGCAATG 



2581 TCGGCAGTGC CAGGGCAGCA CTCATTCACT TGATAAATGA ATATTTATTA 



2641 GAGCTAGAAC CTGGAGAGCT AGAACCTGGA GAACTAGAAC CTGGAGGGCT 



2701 GAGGCTAGAA CCAAGAAGGG CTAGAACCTG GAGGGGCTAG AACCTAGAGA 



2761 TGAGCTAGAA GCTGGAGGAC TAGAACCTGG AGGGCTGGAA TCTGAAGGGC 



2821 AGGGCTGGAA TCTGGAGAGC TAGAACCTGG AGGGCTAGAA CCTGGAGGGC 



2881 AAGGGCTAGA ACCTGGAGGG CTGGAATCTG GAGAGCTAGA ACCTGGAGGG CTAGAACCTG 2940 



2941 GAGGGCTAGA ACCTAGAAGG GCTAGAACCT GGAGGGCTAG AACCTGGCAG GTTAGAACCT 3000 



3001 AGAAGGGCTA GAACCTGGAG AGCCAGAACC TGGAGGGCTA GAACCTGGAA GGGCTAGAAC 3060 



3061 CTGTAGAGCT AGAACATGGA GAGCTAGAAC CCGGCAGGCT AGAACCTGGC AAGCTAGAAC 3120 



3121 CTGGAGGGAA TGAACCTGGA GGGCTAGAAC CTGGAGAATG AGAAAAATTT ACATGGCAAA 3180 



3181 GAGCCCATAA ATCCTGACCA ATCCAACTCT GAATTTTAAA GCAAAAGCGT GAAAAAAAAG 3240 



AGGAAAAGAC 2160 



TGGCGGTGTG 2220 



GTCATTCCCA 2280 



CCATGTCTTC 2340 



CCAGAGGATC 2400 



GCAGGCCTGC 2460 



TGGTTTATTG 2520 



GTGTTCACCA 2580 



GCTGGTTGGA 2640 



AGAACCTGGA 2700 



AGCTAAAACC 2760 



TAGAACCTGG 2820 



TAGAACCTAG 2880 



FIG. 38D 



3241 ATTCCCTCCT TACCCCCAAC CCACTCTTTT TTCCCACCAC CCACTCTCCT CTGCCTCAGT 3300 



3301 AAGTATCTGG AGGAAGAAAA CAGGTGAAAG AAGAAGTAAA AACCATTTAG TATTAGTATT 3360 



3361 AGAATGAAGT CAAACTGTGC CACACATGGT GAATGAAAAA AAAAAAAAAG AGGCTGTGTT 3420 



3421 TTGTCACACA GGGCAGTCAT TCAGCACCAG AGCACGTGAT GGTCTGAGAC TCTCTTAGGA 3480 



3481 GCAGAGCTCT GCCGCAATGG CCATGTGGGG ATCCACACCT GGTCTGAGGG GCAACTGAGT 3540 



3541 CTGCGGGAGA AGAGCGGCCC TATGCATGGT GTAGATGCCC TGATAAAGAA CATCTGTCCT 3600 



3601 GTGAAAGACT CAATGAGCTG TTATGTTGTA AACAGGAAGC ATTTCACATC CAAACGAGAA 3660 



3661 AATCATGTAA ACATGTGTCT TTTCTGTAGA GCATAATAAA TGGATGAGGT TTTTGCAAAA 3720 



3721 AAAAAAAAAA AAA 3733 
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FIG. 41A 

1 CACCCTATCC TACACTACTA GGAACTTGCA CAGTCCGCCT CGQGCAGCCC AAAGCTCCTC 60 



• • • • • 

61 TGCCCACCCT GGCTCCCAAA ACCCTCCAAA ACAAAAGACC AGAAAAGCAC TCTCCACCCA 120 



121 GCAGCCAAAC GCCTCCTTCT TGACGCCAGC CCCCACCCTC TGTCTGCTCG AGCCCAGGAA 180 
181 AGGCCTGAAG GAACAGGCCG GGGAAGGAGC CCTCCCTCTC TCCCTTGTCC CTCCATCCAC 240 



241 CCAGCGCCGG CATCTGGAGA CCCTATGGCC CGGGCTCACT GGGGCTGCTG CCCCTGGCTG 300 
1 MARAHWGCCPWL12 

301 GTCCTCCTCT GTGCTTGTGC CTGGGGCCAC AC AAAGC CAC TGGACCTTGG AGGGCAGGAT 360 
13VLLC ACA WGH TKPL DLG GQD 32 

361 GTGAGAAATT GTTCCACCAA CCCCCCTTAC CTTCCAGTTA CTGTGGTCAA TACCACAATG 420 
33VRNC STN PPY LPVT VVN TTM 52 



421 TCACTCACAG CCCTCCGCCA GCAGATGCAG ACCCAGAATC TCTCAGCCTA CATCATCCCA 480 
53 S L T A LRQ Q M Q TQNL SAY IIP 72 

481 GGCACAGATG CTCACATGAA CGAGTACATC GGCCAACATG ACGAGAGGCG TGCGTGGATT 540 
73GTDA HMN EYI GQHD ERR AWI 92 

541 ACAGGCTTTA CAGGGTCTGC AGGAACTGCA GTGGTGACTA TGAAGAAAGC AGCTGTCTGG 600 
93TGFT GSA GTA VVTM KKA AVW 112 

601 ACCGACAGTC GCTACTGGAC TCAGGCTGAG CGGCAAATGG ACTGTAATTG GGAGCTCCAT 660 
113 TDSR YWT QAE RQMD CNW ELH 132 

661 AAGGAAGTTG GCACCACTCC TATTGTCACC TGGCTCCTCA CCGAGATTCC TGCTGGAGGG 720 
133 KEVG TTP IVT WLLT EIP AGG 152 

• •••"* 
721 CGTGTGGGTT TTGACCCCTT CCTCTTGTCC ATTGACACCT GGGAGAGTTA TGATCTGGCC 780 
153 RVGF DPF L L S IDTW ESY DLA 172 

781 CTCCAAGGCT CTAACAGACA GCTGGTGTCC ATCACAACCA ATCTTGTGGA CCTGGTATGG 840 
173 LQGS NRQ LVS ITTN LVD LVW 192 

841 GGATCAGAGA GGCCACCGGT TCCAAATCAA CCCATTTATG CCCTGCAGGA GGCATTCACA 900 
193GSER PPV PNQ PIYA LQE AFT 212 

901 GGGAGCACTT GGCAGGAGAA AGTATCTGGC GTCCGAAGCC AGATGCAGAA GCATCAAAAG 960 
213 GSTW QEK VSG VRSQ MQK HQK 232 



FIG. 41B 



961 GTCCCGACTG CCGTCCTTCT GTCGGCGCTT 
233 VPTA V L L SAL 



1021 GCCAGTGACA TCCCCTATAA CCCCTTCTTC 
253 ASDI PYN PFF 



1081 ATTAGGTTGT TTGCAAACAA GAGTCGCTTT 
273 IRLF ANK SRF 



1141 AGTTGCACAG GCCCCATGTG TGTGCAAATC 
293 SCTG PMC VQI 

1201 CAGGC CTACT CATTGGGAGA TGTGAGGATC 
313 QAYS L G D VRI 



• • • 

1261 ATCTATGAAA TGATACCAAG GGAGAAACTC 
333 IYEM IPR E K L 



1321 ACCAAGGCAG TGAAGAACAG CAAGGAGCAG 
353 TKAV KNS KEQ 



1381 GCTGTGGCTG TGATCCGGTA CTTGGTCTGG 
373 AVAV IRY LVW 



• • • 

1441 GATGAGTTTT CGGGGGCAGA GATCGTGGAC 
393 DEFS GAE IVD 



1501 GGACCCAGTT TTGAAAC CAT CTCTGCTAGT 
413 GPSF ETI SAS 



1561 CCGACCAAGG AGCTGAACCG CAAGCTGTCC 
433 PTKE L N R K L S 



♦ • • 
1621 GGGCAGTACT GGGACGGGAC CACAGACATC 

453 GQYW DGT TDI 

1681 GCCTTTCAGA AGGAGGCATA TACCCGTGTG 
473 AFQK EAY TRV 

1741 ATCTTTCCCG CTGCTACATC AGGGCGAATG 
493 IFPA ATS GRM 

• • • 
1801 GATGCTGGTC TCAATTATGG TCATGGGACA 

513 D AGL NYG HGT 



GAGGAGACGG CCTGGCTCTT CAACCTTCGA 1020 
EETA W L F N L R 252 



TATTCCTACA CGCTGCTCAC AGACTCTTCT 1080 
YSYT L L T DSS 272 



AGCTCCGAAA CCTTGAGCTA TCTGAACTCC 1140 
SSET L S Y L N S 292 



GAGGATTACA GCCAAGTTCG TGACAGCATC 1200 
EDYS QVR DSI 312 



• • • 

TGGATTGGGA C C AGCT AT AC CATGTATGGG 1260 
WIGT SYT MYG 332 



GTGACAGACA CCTACTCCCC AGTGATGATG 1320 
VTDT YSP VMM 352 



• • • 

GCCCTCCTCA AGGCCAGCCA CGTGCGGGAC 1380 
ALLK ASH VRD 372 



• • • 

CTGGAGAAGA ACGTGCCCAA AGGCACAGTG 1440 
L E K N VPK GTV 392 



• • • 

AAGTTCCGAG GAGAAGAACA GTTCTCCTCC 1500 
KFRG EEQ FSS 412 



GGTTTGAATG CTGCCCTGGC CCACTACAGC 1560 
G L N A ALA HYS 432 



• • • 

TCAGATGAGA TGTACCTGCT GGACTCTGGG 1620 
SDEM YLL DSG 452 



ACCAGAACAG TCCACTGGGG CACCCCCTCT 1680 
TRTV HWG TPS 472 



• • • 

CTGATAGGAA ATATTGACCT GTCCAGGCTC 1740 
LIGN IDL SRL 492 



• • • 

GTGGAGGCCT TTGCCCGCAG AGCCTTGTGG 1800 
VEAF ARR ALW 512 



GGCCACGGCA TTGGCAACTT CCTGTGTGTG 1860 
GHGI GNF LCV 532 



FIG. 41C 

1861 CATGAGTGGC CAGTGGGATT CCAGTCCAAC AACATCGCTA TGGCCAAGGG CATGTTCACT 1920 
533 HEWP VGF QSN NIAM AKG MFT 552 

1921 TCCATTGAAC CTGGTTACTA TAAGGATGGA GAATTTGGGA TCCGTCTCGA AGATGTGGCT 1980 
553 SIEP GYY KDG EFGI RLE DVA 572 

1981 CTCGTGGTAG AAGCAAAGAC CAAGTACCCA GGGGAGCTAC CTGACCTTGT GGTATCATTT 2040 
573 L V V E AKT KYP GELP D L V VSF 592 

2041 GTGCCCTATG ACCGGAACCT CATCGATGTC AGCCTGCTGT CTCCCGAGCA TCTCCAGTAC 2100 
593 VPYD R N L IDV S L L S PEH L Q Y 612 

2101 CTGAATCGCT ACTACCAGAC CATCCGGGAG AAGGTGGGTC CAGAGCTGCA GAGGCGCCAG 2160 
613LNRY YQT IRE KVGP E L Q RRQ 632 

2161 CTACTAGAGG AGTTCGAGTG GCTTCAACAG CACACAGAGC CCCTGGCCGC CAGGGCCCCA 2220 
633 LLEE FEW L Q Q HTEP L A A RAP 652 

2221 GACACCGCCT CCTGGGCCTC TGTGTTAGTG GTCTCCACCC TTGCCATCCT TGGCTGGAGT 2280 
653 DTAS WAS VLV VSTL AIL GWS 672 

2281 GTCTAGAGGC TCCAGACTCT CCTGTTAACC CTCCATCTAG ATGGGGGGCT CCCTTGCTTA 2340 
673 V * 673 

2341 GCTCCCCTCA CCCTGCACTG AACATACCCC AAGAGCCCCT GCTGGCCCAT TGCCTAGAAA 2400 
2401 CCTTTGCATT CATCCTCCTT CTCCAAGACC TATGGAGAAG GTCCCAGGCC CCAGGAAACA 2460 
2461 CAGGGCTTCT TGGCCCCAGA TGGCACCTCC CTGCACCCCG GGGTTGTATA CCACACCCTG 2520 
2521 GGCCCCTAAT CCCAGGCCCC GAAATAGGAA AGCCAGCTAG TCTCTTCTCT TCTGTGATCT 2580 
2581 CAGTAGGCCT AACCTATAAC CTAACACAGA CTGCTACAGC TGCTCCCCTC CCGCCAAACA 2640 

2641 AAGCCCCAAG AAAACAATGC CCCTACCACC CAAGGGTGCC ATGGTCCCGG GAAAACCCAA 2700 

• * • • 

2701 CCTGTCACCG CGTGTTGGGC GTAACCAGAA CTGTTCCCCC CCACCAGGGC TTAAAAATCG 2760 

2761 CCCCCACTTT TTAACCATCG TCCATTAACC ACCTGGTGGG CATAGCCAGA GCTGTTCGAA 2820 

2821 CCCAGCCAGG GATGAAAAAT CAACCCCCGA CATGGAACCC ATGATTCCTA AACCCGGGGT 2880 



2881 AGGTTCCATG CCAAGTAACA GCAGAGGGAG TTAAGCCATA GGAATTTGGC TGTGGAGTAA 2940 



FIG. 41D 

2941 GAGGGAATGC GGTGAGGCAG TGTGGAATAT GACCCTACCA GAGGTTGGAG AACAAACTTG 3000 
3001 GGCAGCCGGA ACCCGTCACT ATTTTAGATT CCTGGCATTC GAGGAGCCCT TTGAACTTTC 3060 
3061 CAAAGTGCAG CCACAGCTAC AATGCTGTTA AATCCTCCCA CATTTCTTGG ATGCCCCTTC 3120 
3121 ACCTTGTGTG GACAGTGTCT GGTTTCCCCA TTTTACAGAC AGGAAAACTG AGCTTCAGAC 3180 
3181 AGGGGGTGGG CTTTGCCTAA GGACACACAA ATTTGGTTGG GAGTTGATGG GGCCAGATGA 3240 
3241 GCCAGCATTC CAGCTGTTTC ACCCTTCAGC AACATGCAGA GTCCCTGAGC CCACCTCCCA 3300 
3301 GCCCTCTCCT CATTCTCTGA ACCCACTGTG GTGAGAAGAA TTTGCTCCGG CCAAATTGGC 3360 
3361 CGTTAGCCAC CTGGGTCCAC ATCCTGCTAA GACGTTTAAA ACAGCCTAAC AAAGACACTT 3420 
3421 GCCTGTGG 3428 



FIG. 42A 



1 CTGTGCATGG CATCATCCTG GCCCCCTCTA GAGCTCCAAT CCTCCAACCA GAGCCAGCTC 60 
1 MA SSW PPL ELQS SNQ SQL 18 

61 TTCCCTCAAA ATGCTACGGC CTGTGACAAT GCTCCAGAAG CCTGGGACCT GCTGCACAGA 120 
19FPQN ATA CDN APEA WDL LHR 38 

121 GTGCTGCCGA CATTTATCAT CTCCATCTGT TTCTTCGGCC TCCTAGGGAA CCTTTTTGTC 180 
39VLPT FII SIC FFGL L G N LFV 58 



181 CTGTTGGTCT TCCTCCTGCC CCGGCGGCAA CTGAACGTGG CAGAAATCTA CCTGGCCAAC 240 
59LLVF LLP RRQ LNVA EIY LAN 78 



241 CTGGCAGCCT CTGATCTGGT GTTTGTCTTG GGCTTGCCCT TCTGGGCAGA GAATATCTGG 300 
79LAAS DLV FVL GLPF WAE NIW 98 



301 AAC CAGTTT A ACTGGCCTTT CGGAGCCCTC CTCTGCCGTG TCATCAACGG GGTCATCAAG 360 
99NQFN WPF GAL LCRV ING VIK 118 

361 GCCAATTTGT TCATCAGCAT CTTCCTGGTG GTGGCCATCA GCCAGGACCG CTACCGCGTG 420 
119 ANLF ISI FLV VAIS QDR YRV 138 



• • • • * 

421 CTGGTGCACC CTATGGCCAG CGGAAGGCAG CAGCGGCGGA GGCAGGCCCG GGTCACCTGC 480 
139LVHP MAS GRQ QRRR QAR VTC 158 



481 GTGCTCATCT GGGTTGTGGG GGGCCTCTTG AGCATCCCCA CATTCCTGCT GCGATCCATC 540 
159VLIW VVG GLL SIPT FLL RSI 178 

541 CAAGCCGTCC CAGATCTGAA CATCACCGCC TGCATCCTGC TCCTCCCCCA TGAGGCCTGG 600 
179 Q A V P DLN I T A CILL LPH EAW 198 

601 CACTTTGCAA GGATTGTGGA GTTAAATATT CTGGGTTTCC TCCTACCACT GGCTGCGATC 660 
199HFAR IVE L N I LGFL LPL A A I 218 

661 GTCTTCTTCA ACTACCACAT CCTGGCCTCC CTGCGAACGC GGGAGGAGGT CAGCAGGACA 720 
219 V F F N YHI LAS LRTR EEV SRT 238 

721 AGAGTGCAGG GGCCGAAGGA TAGCAAGACC ACAGCGCTGA TCCTCACGCT CGTGGTTGCC 780 
239 R V Q G PKD SKT TALI LTL VVA 258 

781 TTCCTGGTCT GCTGGGCCCC TTACCACTTC TTTGCCTTCC TGGAATTCTT ATTCCAGGTG 840 
259 FLVC WAP YHF FAFL EFL FQV 278 

841 CAAGCAGTCC GAGGCTGCTT TTGGGAGGAC TTCATTGACC TGGGCCTGCA ATTGGCCAAC 900 
279 QAVR GCF WED FIDL GLQ LAN 298 



FIG. 42B 



• • • 

901 TTCTTTGCCT TCACTAACAG CTCCCTGAAT 
299 FFAF TNS SLN 

961 TTCAGGACCA AGGTCTGGGA ACTTTATAAA 
319 FRTK VWE LYK 



1021 TCTTCATCCC ATAGGAAAGA AATCTTCCAA 
339 SSSH RKE IF Q 



♦ 

CCAGTAATTT ATGTCTTTGT GGGCCGGCTC 960 
PVIY VFV GRI- 318 



• • • 

CAATGCACCC CTAAAAGTCT TGCTCCAATA 1020 
QCTP KSL API 338 



• * 
CTTTTCTGGC GGAATTAAAA CAGCATTGAA 1080 
L F W R N * 353 



1081 CC 1082 



FIG. 43A 

1 ATGTTCTCTC CCTGGAAGAT ATCAATGTTT CTGTCTGTTC GTGAGGCCTC CGTGCCCACC 60 
1MFSP WKI SMF LSVR E A S VPT 20 

61 ACGGCCTCTT TCAGCGCCGA CATGCTCAAT GTCACCTTGC AAGGGCCCAC TCTTAACX3GG 120 
21TASF SAD MLN VTLQ GPT LNG 40 

* 

121 ACCTTTGCCC AGAGCAAATG CCCCCAAGTG GAGTGGCTGG GCTGGCTCAA CACCATCCAG 180 
41TFAQ SKC PQV EWLG WLN TIQ 60 

181 CCCCCCTTCC TCTGGGTGCT GTTCGTGCTG GCCACCCTAG AGAACATCTT TGTCCTCAGC 240 
61PPFL WVL FVL A T L E NIF VLS 80 



241 GTCTTCTGCC TGCACAAGAG CAGCTGCACG GTGGCAGAGA TCTACCTGGG GAACCTGGCC 300 
81VFCL HKS SCT VAEI YLG NLA 100 

301 GCAGCAGACC TGATCCTGGC CTGCGGGCTG CCCTTCTGGG CCATCACCAT CTCCAACAAC 360 
101 A A D L I L A C G L PFWA ITI SNN 120 



361 TTCGACTGGC TCTTTGGGGA GACGCTCTGC CGCGTGGTGA ATGCCATTAT CTCCATGAAC 420 
121 FDWL FGE TLC RVVN All SMN 140 



421 CTGTACAGCA GCATCTGTTT CCTGATGCTG GTGAGCATCG ACCGCTACCT GGCCCTGGTG 480 
141 LYSS ICF LML VSID RYL ALV 160 



481 AAAACCATGT CCATGGGCCG GATGCGCGGC GTGCGCTGGG CCAAGCTCTA CAGCTTGGTG 540 
161 KTMS MGR MRG VRWA KLY SLV 180 

• ••••• 

541 ATCTGGGGGT GTACGCTGCT CCTGAGCTCA CCCATGCTGG TGTTCCGGAC CATGAAGGAG 600 
181 IWGC TLL LSS PMLV FRT MKE 200 

. 

601 TACAGCGATG AGGGCCACAA CGTCACCGCT TGTGTCATCA GCTACCCATC CCTCATCTGG 660 
201 YSDE GHN VTA CVIS YPS LIW 220 

661 GAAGTGTTCA CCAACATGCT CCTGAATGTC GTGGGCTTCC TGCTGCCCCT GAGTGTCATC 720 
221 EVFT NML LNV VGFL LPL SVI 240 

721 ACCTTCTGCA CGATGCAGAT CATGCAGGTG CTGCGGAACA ACGAGATGCA GAAGTTCAAG 780 
241 TFCT MQI MQV LRNN EMQ KFK 260 

781 GAGATCCAGA CGGAGAGGAG GGCCACGGTG CTAGTCCTGG TTGTGCTGCT GCTATTCATC 840 
261 EIQT ERR A T V LVLV VLL LFI 280 

841 ATCTGCTGGC TGCCCTTCCA GATCAGCACC TTCCTGGATA CGCTGCATCG CCTCGGCATC 900 
281 ICWL PFQ 1ST FLDT LHR LGI 300 



FIG. 43B 



901 CTCTCCAGCT GCCAGGACGA GCGCATCATC GATGTAATCA CACAGATCGC CTCCTTCATG 960 
301 LSSC QDE RII DVIT Q I A SFM 320 

961 GCCTACAGCA ACAGCTGCCT CAACCCACTG GTGTACGTGA TCGTGGGCAA GCGCTTCCGA 1020 
321 AYSN SCL NPL VYVI VGK RFR 340 

• ••••• 

1021 AAGAAGTCTT GGGAGGTGTA CCAGGGAGTG TGCCAGAAAG GGGGCTGCAG GTCAGAACCC 1080 
341 KKSW EVY QGV CQKG GCR SEP 360 

1081 ATTCAGATGG AGAACTCCAT GGGCACACTG CGGACCTCCA TCTCCGTGGA ACGCCAGATT 1140 
361 IQME NSM GTL RTSI SVE R Q I 380 

1141 CACAAACTGC AGGACTGGGC AGGGAGCAGA CAGTGAGCAA ACGCCAGCAG GGCTGCTGTG 1200 
381 HKLQ DWA GSR Q * 391 

1201 AATTTGTGTA AGGATTGAGG GACAGTTGCT TTTCAGCATG GGCCCAGGAA TGCCAAGGAG 1260 

1261 ACATCTATGC ACGACCTTGG GAAATGAGTT GATGTCTCCG GTAAAACACC GGAGACTAAT 1320 

1321 TCCTGCCCTG CCCAATTTTG CAGGGAGCAT GGCTGTGAGG ATGGGGTGAA CTCACGCACA 1380 

1381 GCCAAGGACT CCAAAATCAC AACAGCATTA CTGTTCTTAT TTGCTGCCAC ACCTGAGCCA 1440 

1441 GCCTGCTCCT TCCCAGGAGT GGAGGAGGCC TGGGGGGAGG GAGAGGAGTG ACTGAGCTTC 1500 

1501 CCTCCCGTGT GTTCTCCGTC CCTGCCCCAG CAAGACAACT TAGATCTCCA GGAGAACTGC 1560 

1561 CATCCAGCTT TGGTGCAATG GCTGAGTGCA CAAGTGAGTT GTTGCCCTGG GTTTCTTTAA 1620 

1621 TCTATTCAGC TAGAACTTTG AAGGACAATT TCTTGCATTA ATAAAGGTTA AGCCC TGAGG 1680 

1681 GGTCCCTGAT AACAACCTGG AGACCAGGAT TTTATGGCTC CCCTCACTGA TGGACAAGGA 1740 

1741 GGTCTGTGCC AAAGAAGAAT CCAATAAGCA CATATTGAGC AC TTGCTGT A TATGCAGTAT 1800 

• ••••• 

1801 TGAGCACTGT AGGCAAG AC C CAAGAAAGAG AAGGAGCCAT CTCCATCTTG AAGGAACTCA 1860 

1861 AAGACTCAAG TGGGAACGAC TGGGCACTGC CACCACCAGA AAGCTGTTCG ACGAGACGGT 1920 

1921 CGAGCAGGGT GCTGTGGGTG ATATGGACAG CAGAAGGGGG AGACCAAGGT TCCAGCTCAA 1980 

1981 CCAATAACTA TTGCACAACC ACCTGTCCCT GCCTCAGTTC CCTTTTATGT AACATGAAGT 2040 



FIG. 43C 

2041 CGTTGTGAGG GTTAAAGGCA GTAACAGGTA TAAAGTACTT AGAAAAGCAA AGGGTGCTAC 2100 
2101 GTACATGTGA GGCATCATTA CGCAGACGTA ACTGGGATAT GTTTACTATA AGGAAAAGAC 2160 
2161 ACTGAGGTCT AGAAATAGCT CCGTGGAGCA GAATCAGTAT TGGGAGCCGG TGGCGGTGTG 2220 
2221 AAGCACCAGT GTCTGGCACA CAGTAGGTGC TCATTGGCTC CCTTCCACCT GTCATTCCCA 2280 
2281 CCACCCTGAG GCCCCAACCG CCACACACAC AGGAGCATTT GGAGAGAAGG CCATGTCTTC 2340 
2341 AAAGTCTGAT TTGTGATGAG GCAGAGGAAG ATATTTCTAA TCGGTCTTGC CCAGAGGATC 2400 
2401 ACAGTGCTGA GACCCCCCAC CACCAGCCGG TACCTGGGAA GGGGGAGAGT GCAGGCCTGC 2460 
2461 TCAGGGACTG TTCCTGTCTC AGCAACCAAG GGATTGTTCC TGTCAATCAA TGGTTTATTG 2520 
2521 GAAGGTGGCC CAGTATGAGC CCTAGAAGAG TGTGAAAAGG AATGGCAATG GTGTTCACCA 2580 
2581 TCGGCAGTGC CAGGGCAGCA CTCATTCACT TGATAAATGA ATATTTATTA GCTGGTTGGA 2640 
2641 GAGCTAGAAC CTGGAGAGCT AGAACCTGGA GAACTAGAAC CTGGAGGGCT AGAACCTGGA 2700 
2701 GAGGCTAGAA CCAAGAAGGG CTAGAACCTG GAGGGGCTAG AACCTAGAGA AGCTAAAACC 2760 
2761 TGAGCTAGAA GCTGGAGGAC TAGAACCTGG AGGGCTGGAA TCTGAAGGGC TAGAACCTGG 2820 
2821 AGGGCTGGAA TCTGGAGAGC TAGAACCTGG AGGGCTAGAA CCTGGAGGGC TAGAACCTAG 2880 
2881 AAGGGCTAGA ACCTGGAGGG CTGGAATCTG GAGAGCTAGA ACCTGGAGGG CTAGAACCTG 2940 

2941 GAGGGCTAGA ACCTAGAAGG GCTAGAACCT GGAGGGCTAG AACCTGGCAG GTTAGAACCT 3000 

• • * * 

3001 AGAAGGGCTA GAACCTGGAG AGCCAGAACC TGGAGGGCTA GAACCTGGAA GGGCTAGAAC 3060 

3061 CTGTAGAGCT AGAACATGGA GAGCTAGAAC CCGGCAGGCT AGAACCTGGC AAGCTAGAAC 3120 

3121 CTGGAGGGAA TGAACCTGGA GGGCTAGAAC CTGGAGAATG AGAAAAATTT ACATGGCAAA 3180 

3181 GAGCCCATAA ATCCTGACCA ATCCAACTCT GAATTTTAAA GCAAAAGCGT GAAAAAAAAG 3240 



FIG. 43D 

• ••••• 

3241 ATTCCCTCCT TACCCCCAAC CCACTCTTTT TTCCCACCAC CCACTCTCCT CTGCCTCAGT 3300 

• ••••• 
3301 AAGTATCTGG AGGAAGAAAA CAGGTGAAAG AAGAAGTAAA AACCATTTAG TATTAGTATT 3360 

• ••••• 
3361 AGAATGAAGT CAAACTGTGC CACACATGGT GAATGAAAAA AAAAAAAAAG AGGCTGTGTT 3420 

• •«••• 
3421 TTGTCACACA GGGCAGTCAT TCAGCACCAG AGCACGTGAT GGTCTGAGAC TCTCTTAGGA 3480 

• ••••• 
3481 GCAGAGCTCT GCCGCAATGG CCATGTGGGG ATCCACACCT GGTCTGAGGG GCAACTGAGT 3540 

3541 CTGCGGGAGA AGAGCGGCCC TATGCATGGT GTAGATGCCC TGATAAAGAA CATCTGTCCT 3600 

• ••••• 
3601 GTGAAAGACT CAATGAGCTG TTATGTTGTA AACAGGAAGC ATTTCACATC CAAACGAGAA 3660 

• ••••• 
3661 AATCATGTAA ACATGTGTCT TTTCTGTAGA GCATAATAAA TGGATGAGGT TTTTGCAAAA 3720 

3721 AAAAAAAAAA AAA 3733 



FIG. 44A 



1 ATGCATCTTATCGACTACCTGCTCCTCCTGCTGGTTGGACTACTGGCCCTTTCTCATGGC 60 

1MHLIDYLLLLLVGLLALSHG 20 

61 CAGCTGCACGTTGAGCATGATGGTGAGAGTTGCAGTAACAGCTCCCACCAGCAGATTCTG 120 

21QLHVEHDGESCSNSSHQQIL 40 

121 GAGACAGGTGAGGGCTCCCCCAGCCTCAAGATAGCCCCTGCCAATGCTGACTTTGCCTTC 180 

41ETGEGSPSLKIAPANADFAF 60 

181 CGCTTCTACTACCTGATCGCTTCGGAGACCCCGGGGAAGAACATCTTTTTCTCCCCGCTG 240 

61RFYYLIASETPGKNIFFSPL 80 

241 AGCATCTCGGCGGCCTACGCCATGCTTTCCCTGGGGGCCTGCTCACACAGCCGCAGCCAG 300 

81 SI SAAYAMLSLGACSHSRSQ 100 

301 ATCCTTGAGGGCCTGGGCTTCAACCTCACCGAGCTGTCTGAGTCCGATGTCCATAGGGGC 360 

101ILEGLGFNLTELSESDVHRG 120 



361 TTCCAGCACCTCCTGCACACTCTCAACCTCCCCGGCCATGGGCTGGAAACACGCGTGGGC 420 

121FQHLLHTLNLPGHGLETRVG 140 

421 AGTGCT C TGTTC CTGAGC C AC AAC CTGAAGTTC C TTGC AAAATTCCTG AATGAC AC C ATG 480 

141SALFLSHNLKFLAKFLNDTM 160 

481 GCCGTCTATGAGGCTAAACTCTTCCACACCAACTTCTACGACACTGTGGGCACAATCCAG 540 

161 AVYEAKLFHTNFY DTVGTIQ 180 

541 CTTATCAACGACC^CGTCAAGAAGGAAACTCGAGGGAAGATTGTGGATTTGGTCAGTGAG 600 

181LINDHVKKETRGKIVDLVSE 200 

601 CTCAAGAAGGACGTCTTGATG^TGCTGGTGAATTACATTTACTTCAAAGCCCTGTGGGAG 660 

201LKKDVLMVLVNYIYFKALWE 220 

661 AAACCATTCATTTCCTCAAGGACCACTCCCAAAGACTTCTATGTTGATGAGAACACAACA 720 

221 KPFI SSRTTPKDFYVDENTT 240 

721 GTCCGGGTGCCCATGATGCTGCAGGACCAGGAGCATCACTGGTATCTTCATGACAGATAC 780 

241VRVPMMLQDQEHHWYLHDRY 260 

781 TTGCCCTGCTCGGTGCTACGGATGGATTACAAAGGAGACGCAACCGTGTTTTTCATTCTC 840 

261LPCSVLRMDYKGDATVFFIL 280 



FIG. 44B 



» • • • • • 

841 CCTAACCAAGGCAAAATGAGGGAGATTGAAGAGGTTCTGACTCCAGAGATGCTAATGAGG 900 

281 PNQGKMREIEEVLT PEMLMR 300 

«••••• 

901 TGGAACAACTTGTTGCGGAAGAGGAATTTTTACAAGAAGCTAGAGTTGCATCTTCCCAAG 960 

301 WNNLLRKRNFYKKLELHLPK 320 



961 TTCTCCATTTCTGGCTCCTATGTATTAGATCAGATTTTGCCCAGGCTGGGCTTCACGGAT 1020 
321 FS I SGSYVLDQILPRLGFTD 340 



« • • • • • 

1021 CTGTTCTCCAAGTGGGCTGACTTATCCGGCATCACCAAACAGCAAAAACTGGAGGCATCC 1080 

341 LFSKWADLSGITKQQKLEAS 360 

«••••• 

1081 AAAAGTTTCCACAAGGCCACCTTGGACGTGGATGAGGCTGGCACCGAGGCTGCAGCAGCC 1140 

361 KSFHKATLDVDEAGTEAAAA 380 

• ••••* 

1141 ACG^CGTTCGCGATCAAATTCTTCTCTGCCCAGACCAATCGCCACATCCTGCGATTCAAC 1200 

381 TTFAIKFFSAQTNRHI LRFN 400 

..•••* 

1201 CGGCCCTTCCTTGTGGTGATCTTTTCCACC AGCACCCAGAGTGTCCTCTTTCTGGGCAAG 1260 

401 RPFLVVIFSTSTQSVLFLGK 420 



1261 GTCGTCGACCCCACGAAACCATAG 1284 
421 VVDPTKP* 428 



FIG. 45A 



1 ATGCATCTTATCGACTACCTGCTCCTCCTGCTGGTTGGACTACTGGCCCTTTCTCATGGC 6 0 

1MHLIDYLLLLLVGLLALSHG 20 

6 1 CAGCTGCACGTTGAGCATGATGGTGAGAGTTGCAGTAACAGCTCCCACCAGCAGATTCTG 120 

21QLHVEHDGESCSNSSHQQIL 40 

• ••••• 

121 GAGACAGGTGAGGGCTCCCCCAGCCTCAAGATAGCCCCTGCCAATGCTGACTTTGCCTTC 180 

41ETGEGSPSLKIAPANADFAF 60 

181 CGCTTCTACTACCTGATCGCTTCGGAGACCCCGGGGAAGAACATCTTTTTCTCCCCX5CTG 240 

61RFYYLIASETPGKNIFFSPL 80 

241 AGCATCTCGGCGGCCTACGCCATGCTTTCCCTGGGGGCCTGCTCACACAGCCGCAGCCAG 300 

81 S I SAAYAMLSLGACSHSRSQ 100 

• ••••• 

301 ATCCTTGAGGGCCTGGGCTTCAACCTCACCGAGCTGTCTGAGTCCGATGTCCATAGGGGC 360 

101 ILEGLGFNLTELSESDVHRG 120 

361 TTCCAGCACCTCCTGCAC ACTCTCAACCTCCCCGGCCATGGGCTGGAAACATGCGTGGGC 420 

121 FQHLLHTLNLPGHGLETCVG 140 

421 AGTGCTCTGTTC CTGAGC CACAACCTGAAGTTCCTTGCAAAATTCCTGAATGAC ACCATG 480 

141 SALFLSHNLKFLAKFLNDTM 160 

481 GCCGTCTATGAGGCTAAACTCTTCCACACCAACTTCTACGACACTGTGGGCACAATCCAG 540 

161 AVYEAKLFHTNFYDTVGTIQ 180 

541 CTTATCAACGACCACGTCAAGAAGGAAACTCGAGGGAAGATTGTGGATTTGGTCAGTGAG 600 

181 LINDHVKKETRGKIVDLVSE 200 

601 CTCAAGAAGGACGTCTTGATGGTGCTGGTGAATTACATTTACTTCAAAGCCCTGTGGGAG 66 0 

201 LKKDVLMVLVNY IYFKALWE 220 

• • • • • 

661 AAACCATTCATTTCCTCAAGGACCACTCCCAAAGACTTCTATGTTGATGAGAACACAACA 720 

221 KPFI SSRTTPKDFYVDENTT 240 

• ••••• 

721 GTCCGGGTGCCCATGATGCTGC AGGACCAGGAGCATCACTGGTATCTTCATGAC AGATAC 780 

241 VRVPMMLQDQEHHWYLHDRY 260 



781 TTGCCCTGCTCGGTGCTACGGATGGATTACAAAGGAGACGCAACCGTGTTTTTCATTCTC 840 
261 LPCSVLRMDYKGDATVFFIL 280 



FIG. 45B 



841 CCTAACCAAGGCAAAATGAGGGAGATTGAAGAGGTTCTGACTCCAGAGATGCTAATGAGG 900 
281 PNQGKMRE I EEVLTPEMLMR 300 



901 TGGAACAACTTGTTGCGGAAGAGGAATTTTTACAAGAAGCTAGAGTTGCATCTTCCCAAG 960 
301 WNNLLRKRNFYKKLELHLPK 320 



961 TTCTCCATTTCTGGCTCCTATGTATTAGATCAGATTTTGCCCAGGCTGGGCTTCACGGAT 1020 
321 FSISGSYVLDQILPRLGFTD 340 



1021 CTGTTCTCCAAGTGGGCTGACTTATCCGGCATCACCAAACAGCAAAAACTGGAGGCATCC 1080 

341 LFSKWADLSGITKQQKLEAS 360 

1081 AAAAGTTTCCACAAGGCCACCTTGGACGTGGATGAGGCTGGCACCGAGGCTGCAGCAGCC 1140 

361 KSFHKATLDVDEAGTEAAAA 380 

1141 ACCACGTTCGCGATCAAATTCTTCTCTGCCCAGACC AATCGCC ACATCCTGCGATTCAAC 1200 

381 TTFAIKFFSAQTNRHILRFN 400 



1201 CGGCCCTTCCTTGTGGTGATCTTTTCCACCAGCACCCAGAGTGTCCTCTTTCTGGGCAAG 1260 
401 RPFLVVIFSTSTQSVLFLGK 420 



1261 GTCGTCGACCCCACGAAACCATAG 1284 
421 VVDPTKP* 428 



FIG. 46A 



1 CACCCTATCC TACACTACTA GGAACTTGCA CAGTCCGCCT CGGGCAGCCC AAAGCTCCTC 60 



61 TGCCCACCCT GGCTCCCAAA ACCCTCCAAA ACAAAAGACC AGAAAAGCAC TCTCCACCCA 120 



121 GCAGCCAAAC GCCTCCTTCT TGACGCCAGC CCCCACCCTC TGTCTGCTCG AGCCCAGGAA 180 



181 AGGCCTGAAG GAACAGGCCG GGGAAGGAGC CCTCCCTCTC TCCCTTGTCC CTCCATCCAC 240 



241 CCAGCGCCGG CATCTGGAGA CCCTATGGCC CGGGCTCACT GGGGCTGCTG CCCCTGGCTG 300 
1 MA RAHW GCC PWL 12 

- 

301 GTCCTCCTCT GTGCTTGTGC CTGGGGCCAC ACAAAGCCAC TGGACCTTGG AGGGCAGGAT 360 
13VLLC ACA WGH TKPL D L G GQD 32 

361 GTGAGAAATT GTTCCACCAA CCCCCCTTAC CTTCCAGTTA CTGTGGTCAA TACCACAATG 420 
33VRNC STN PPY LPVT V V N TTM 52 

421 TCACTCACAG CCCTCCGCCA GCAGATGCAG ACCCAGAATC TCTCAGCCTA CATCATCCCA 480 
53SLTA L R Q QMQ TQNL SAY IIP 72 

481 GGCACAGATG CTCACATGAA CGAGTACATC GGCCAACATG ACGAGAGGCG TGCGTGGATT 540 
73GTDA HMN EYI GQHD ERR AWI 92 

541 ACAGGCTTTA CAGGGTCTGC AGGAACTGCA GTGGTGACTA TGAAGAAAGC AGCTGTCTGG 600 
93TGFT GSA GTA VVTM KKA AVW 112 

601 ACCGACAGTC GCTACTGGAC TCAGGCTGAG CGGCAAATGG ACTGTAATTG GGAGCTCCAT 660 
113 TDSR YWT Q A E RQMD CNW ELH 132 

661 AAGGAAGTTG GCACCACTCC TATTGTCACC TGGCTCCTCA CCGAGATTCC NGCTGGAGGG 720 
133 KEVG TTP IVT WLLT EIP AGG 152 

721 CGTGTGGGTT TTGACCCCTT CCTCTTGTCC ATTGACACCT GGGAGAGTTA TGATCTGGCC 780 
153 RVGF DPF LLS IDTW ESY DLA 172 

781 CTCCAAGGCT CTAACAGACA GCTGGTGTCC ATCACAACCA ATCTTGTGGA CCTGGTATGG 840 
173LQGS NRQ LVS ITTN LVD LVW 192 

841 GGATCAGAGA GGCCACCGGT TCCAAATCAA CCCATTTATG CCCTGCAGGA GGCATTCACA 900 
193 GSER PPV PNQ PIYA LQE AFT 212 



901 GGGAGCACTT GGCAGGAGAA AGTATCTGGC GTCCGAAGCC AGATGCAGAA GCATCAAAAG 960 
213 GSTW QEK VSG VRSQ MQK HQK 232 



FIG. 46B 



• ••••• 

961 GTCCCGACTG CCGTCCTTCT GTCGGCGCTT GAGGAGACGG CCTGGCTCTT CAACCTTCGA 1020 
233 VPTA V L L SAL EETA W L F NLR 252 



• ••••• 

1021 GCCAGTGACA TCCCCTATAA CCCCTTCTTC TATTCCTACA CGCTGCTCAC AGACTCTTCT 1080 
253 ASDI PYN PFF YSYT LLT DSS 272 



• ••••• 

1081 ATTAGGTTGT TTGCAAACAA GAGTCGCTTT AGCTCCGAAA CCTTGAGCTA TCTGAACTCC 1140 
273 IRLF ANK SRF SSET L S Y L N S 292 



• ••••• 

1141 AGTTGCACAG GCCCCATGTG TGTGCAAATC GAGGATTACA GCCAAGTTCG TGACAGCATC 1200 
293 SCTG PMC VQI EDYS QVR DSI 312 



• ••••• 

1201 CAGGCCTACT CATTGGGAGA TGTGAGGATC TGGATTGGGA CCAGCTATAC CATGTATGGG 1260 
313 QAYS L G D VRI WIGT SYT MYG 332 



• ••••• 

1261 ATCTATGAAA TGATACCAAG GGAGAAACTC GTGACAGACA CCTACTCCCC AGTGATGATG 1320 
333 IYEM IPR EKL VTDT YSP VMM 352 



• ••••• 

1321 ACCAAGGCAG TGAAGAACAG CAAGGAGCAG GCCCTCCTCA AGGCCAGCCA CGTGCGGGAC 1380 
353 TKAV KNS KEQ ALLK ASH VRD 372 



• • • • • • 

1381 GCTGTGGCTG TGATCCGGTA CTTGGTCTGG CTGGAGAAGA ACGTGCCCAA AGGCACAGTG 1440 
373 AVAV IRY L V W L E K N VPK GTV 392 



• ••••« 
1441 GATGAGTTTT CGGGGGCAGA GATCGTGGAC AAGTTCCGAG GAGAAGAACA GTTCTCCTCC 1500 

393 DEFS GAE IVD KFR.G EEQ FSS 412 

• ••••• 
1501 GGACCCAGTT TTGAAACCAT CTCTGCTAGT GGTTTGAATG CTGCCCTGGC CCACTACAGC 1560 

413 GPSF ETI SAS G L N A ALA HYS 432 



>•••«« 
1561 CCGACCAAGG AGCTGAACCG CAAGCTGTCC TCAGATGAGA TGTACCTGCT GGACTCTGGG 1620 
433 PTKE LNR KLS SDEM YLL DSG 452 



• ••••• 

1621 GGGCAGTACT GGGACGGGAC CACAGACATC ACCAGAACAG TCCACTGGGG CACCCCCTCT 1680 
453 GQYW DGT TDI TRTV HWG TPS 472 



• ••••• 

1681 GCCTTTCAGA AGGAGGCATA TACCCGTGTG CTGATAGGAA ATATTGACCT GTCCAGGCTC 1740 
473 AFQK EAY TRV LIGN IDL SRL 492 



1741 ATCTTTCCCG CTGCTACATC AGGGCGAATG GTGGAGGCCT TTGCCCGCAG AGCCTTGTGG 1800 
493 IFPA ATS GRM VEAF ARR ALW 512 



1801 GATGCTGGTC TCAATTATGG TCATGGGACA GGCCACGGCA TTGGCAACTT CCTGTGTGTG 1860 
513 DAGL NYG HGT GHGI GNF LCV 532 



FIG. 46C 



1861 CATGAGTGGC CAGTGGGATT CCAGTCCAAC AACATCGCTA TGGCCAAGGG CATGTTCACT 1920 
533 HEWP VGF QSN NIAM AKG MFT 552 

1921 TCCATTGAAC CTGGTTACTA TAAGGATGGA GAATTTGGGA TCCGTCTCGA AGATGTGGCT 1980 
553 SIEP GYY KDG EFGI RLE DVA 572 

1981 CTCGTGGTAG AAGCAAAGAC CAAGTACCCA GGGGAGCTAC CTGACCTTGT GGTATCATTT 2040 
573 L V V E AKT KYP GELP DLV VSF 592 

• 

2041 GTGCCCTATG ACCGGAACCT CATCGATGTC AGCCTGCTGT CTCCNGAGCA TCTCCAGTAC 2100 
593 VPYD R N It IDV S L L S PEH L Q Y 612 

2101 CTGAATCGCT ACTACCAGAC CATCCGGGAG AAGGTGGGTC CAGAGCTGCA GAGGCGCCAG 2160 
613 LNRY YQT IRE KVGP E L Q RRQ 632 

2161 CTACTAGAGG AGTTCGAGTG GCTTCAACAG CACACAGAGC CCCTGGCCGC CAGGGCCCCA 2220 
633 L L E E FEW L Q Q HTEP LAA RAP 652 

• 

2221 GACACCGCCT CCTGGGCCTC TGTGTTAGTG GTCTCCACCC TTGCCATCCT TGGCTGGAGT 2280 
653 DTAS WAS VLV VSTL AIL GWS 672 



2281 GTCTAGAGGC TCCAGACTCT CCTGTTAACC CTCCATCTAG ATGGGGGGCT CCCTTGCTTA 2340 



673 V * 



673 



2341 GCTCCCCTCA CCCTGCACTG AACATACCCC 



2401 CCTTTGCATT CATCCTCCTT CTCCAAGACC 



2461 CAGGGCTTCT TGGCCCCAGA TGGCACCTCC 



2521 GGCCCCTAAT CCCAGGCCCC GAAATAGGAA 



2581 CAGTAGGCCT AACCTATAAC CTAACACAGA 



2641 AAGCCCCAAG AAAACAATGC CCCTACCACC 



AAGAGCCCCT GCTGGCCCAT TGCCTAGAAA 



TATGGAGAAG GTCCCAGGCC CCAGGAAACA 



CTGCACCCCG GGGTTGTATA CCACACCCTG 



AGCCAGCTAG TCTCTTCTCT TCTGTGATCT 



CTGCTACAGC TGCTCCCCTC CCGCCAAACA 2640 



CAAGGGTGCC ATGGTCCCGG GAAAACCCAA 2700 



2701 CCTGTCACCG CGTGTTGGGC GTAACCAGAA CTGTTCCCCC CCACCAGGGC TTAAAAATCG 2760 



2761 CCCCCACTTT TTAACCATCG TCCATTAACC ACCTGGTGGG CATAGCCAGA GCTGTTCGAA 2820 



2821 CCCAGCCAGG GATGAAAAAT CAACCCCCGA CATGGAACCC ATGATTCCTA AACCCGGGGT 2880 



2400 



2460 



2520 



2580 



2881 AGGTTCCATG CCAAGTAACA GCAGAGGGAG TTAAGCCATA GGAATTTGGC TGTGGAGTAA 2940 



FIG. 46D 

2941 GAGGGAATGC GGTGAGGCAG TGTGGAATAT GACCCTACCA GAGGTTGGAG AACAAACTTG 3000 
3001 GGCAGCCGGA ACCCGTCACT ATTTTAGATT CCTGGCATTC GAGGAGCCCT TTGAACTTTC 3060 
3061 CAAAGTGCAG CCACAGCTAC AATGCTGTTA AATCCTCCCA CATTTCTTGG ATGCCCCTTC 3120 
3121 ACCTTGTGTG GACAGTGTCT GGTTTCCCCA TTTTACAGAC AGGAAAACTG AGCTTCAGAC 3180 
3181 AGGGGGTGGG CTTTGCCTAA GGACACACAA ATTTGGTTGG GAGTTGATGG GGCCAGATGA 3240 
3241 GCCAGCATTC CAGCTGTTTC ACCCTTCAGC AACATGCAGA GTCCCTGAGC CCACCTCCCA 3300 
3301 GCCCTCTCCT CATTCTCTGA ACCCACTGTG GTGAGAAGAA TTTGCTCCGG CCAAATTGGC 3360 
3361 CGTTAGC C AC CTGGGTCCAC ATCCTGCTAA GACGTTTAAA ACAGCCTAAC AAAGACACTT 3420 
3421 GCCTGTGG 3428 
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